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CGAGAGACGACAGAAGGTTACGGCTGCGAGAAGACGACAGAAGGGTCCAGAAAAA 

GGAAAGTGCTGGAGGGGAGTGGGGACAAAAGCAGCGACCAAGTGAATGTCACTTC 

AGTGACrGAGGCCAGGCAAAACGCGCGGGAAGGATTTTGTGTAGCTTGGGACCCTT 

TCATAGACACTGATGACACGTTTACGCAAAATAGAAATTTGAGGAGAAACGCCTGG 

GCCTTCGGAAAGGAGTGATTGATTAGTACTTGCAAGTTTAGGTGACTTTAAGGAGAA 

CTAACTAATGTATACTATTGAGGGAGGAGGAAGAGCATTACAGAGTTTCCAGCAGC 

AGCAGGAAAGC1TTGGTTAATTTGGAAATGGATGATAGCATTAAAATAACAGAAGC 

GCCTCCAGGTCTCTGAAGCTTCAGTCCCCCAGCTGAAAGCCAGAAAAGACTAAGCC 

CACTAAGCCITTTGATCCCITTGGAAGCAAAGAACTTTCCTTCCCTGGGGTGAAGAC 

TCTCCtCAGAAGATTTCCTGTCTCTGCCTATGTTACAAGAGGAATCAAAACCAAGAC 

AGAAGAGCTCAGGATGCAGGTGAGAGGCAGGGAAGTCAGCGGCTTGTTGATCTCCC 

TCACTGCTGTCTGCCTGGTGGTCACCCCTGGGAGCAGGGCGTGTCCTCGCCGCTGTG 

CCTGCTATGTGCCCACAGAGGTGCACTGTACATTTCGGTACCTGACCTCCATCCCAG 

ATGGCATCCCGGCCAATGTGGAACGAATAAATTTAGGATATAACAGCCTTACTAGAT 

TGACAGAAAACGACTTTGATGGCCTGAGCAAACTGGAGTTACTGATGCTGCACAGT 

AATGGCATTCACAGAGTCAGTGACAAGACCTTCTCGGGCTTGCAGTCCTTGCAGGTC 

TTAAAAATGAGCTATAACAAAGTCCAAATCATTCGGAAGGATACTTTCTACGGACTC 

GGGAGCTTGGTCCGGTTGCACCTGGATCACAACAACATTGAATTCATCAACCCTGAG 

GCCTTTTATGGACTTACCTCGCTCCGCTTGGTACATTTAGAAGGAAACCGGCTCACA 

AAGCTCCATCCAGACACATTTGTCTCATTAAGCTATCTCCAGATATTTAAAACCTCTT 

TCATTAAGTACCTGTTCTTGTCTGATAACTTCCTGACCTCCCTCCCAAAAGAAATGGT 

CTCCTACATGCCAAACCTAGAAAGCCTGTATTTGCATGGAAACCCATGGACCTGTGA 

CTGCCATTTAAAGTGGTTGTCTGAGTGGATGCAGGGAAACCCAGATATAATAAAAT 

GCAAGAAAGACAGAAGCTCTTCCAGTCCTCAGCAATGTCCCCTTTGCATGAACCCCA 

GGATCTCTAAAGGCAGACCCTTTGCTATGGTACCATCTGGAGCTTTCCTATGTACAA 

AGCCAACCATTGATCCATCACTGAAGTCAAAGAGCCTGGTTACTCAGGAGGACAAT 

GKjATCTGCCTCCACCTCACCTCAAGATTTCATAGAACC 

ACATGACANA>n s rrNTCTGGAAATAAGGCCGACATGGTCTGTAGTATCCAAAAGCCA 

TCAAGGACATCACCAACTGCATTCACTGAAGAAAATGACTACATCATGCTAAATGC 

GTCATTTTCCACAAATCTTGTGTGCAGTGTAGATTATAATCACATCCAGCCAGTGTG 

GCAACTrCTGGCTTTATACAGTGACTCTCCTCTGATACTAGAAAGGAAGCCCCAGCT 

TACCGAGACTCCTTCACTGTCTTCTAGATATAAACAGGTGGCTCTTAGGCCTGAAGA 

CATTITTACCAGCATAGAGGCTGATGTCAGAGCAGACCCTnTTGGTTCCAACAAGA 

AAAAATTGTCTTGCAGCTGAACAGAACTGCCACCACACTTAGCACATTACAGATCCA 

GTTTTCCACTGATGCTCAAATCGCTTTACCAAGGGCGGAGATGAGAGCGGAGAGAC 

TCAAATGGACCATGATCCTGATGATGAACAATCCCAAACTGGAACGCACTGTCCTGG 

TTGGCGGCACTATTGCCCTGAGCTGTCCAGGCAAAGGCGACCCTTCACCTCACTTGG 

AATGGCTTCTAGCTGATGGGAGTAAAGTGAGAGCCCCTTACGtTAGCGAGGATGGG 



cgaatcctaatagacaaaaatgggaagttggaactgcagatggctgacagctttga 

tgcaggtctttaccactgcataagcaccaatgatgcagatgcggatgttctcacata 

caggataactgtggtagagccctatggagaaagcacacatgacagtggagtccagc 

acacagtggttacgggtgagacgctcgaccttccatgk:ctttccacgggtgttccag 

atgcttctattagctggattcttccagggaacactgtgttctctcagccatcaagag 

acaggcaaattcttaacaatgggacctraagaatattacaggttacgccaaaagat 

caaggtcattaccaatgtgtggctck:caacccatcaggck3ccgacttttccag1t^ 

aaagtttcagttcaaaagaaaggccaaaggatggttgagcatgacagggaggcag 

gtggatctggacttggagaacccaactccagtgtttcccttaagcagccagcatctt 

tgaaactctctgcatcack:tttgacagggtcagaggctggaaaacaagtctccggtg 

tacataggaagaacaaacatagagacttaatacatcggcggcgtggggattccacg 

ctccggcgattcagggagcataggaggcagctccctctctctgctcggagaattgac 

ccgcaacgctgggcagcacttctagaaaaagccaaaaagaattctgtgccaaaaaa 

gcaagaaaataccacagtaaagccagtgccactggctgttcccctcgtggaactca 

ctgacgaggaaaaggatgcctctggcatgattcctccagatgaagaattcatggttc 

tgaaaactaaggcttctggtgtcccaggaaggtcaccaacrgctgactctggaccag 

taaatcatggttttatgacgagtatagcttctggcacagaagtctcaactgtgaatc 

cacaaagactacaatctgagcacgttcctgatttcaaattatttagtgtaacaaacg 

gtacagctgtgacaaagagtatgaacccatccatagcaagcaaaatagaagataca 

accaaccaaaacccaatcattatcttrccatcagtagctgaaatrcgagattctgct 

caggcaggaagagcatcttcccaaagtgcacaccctgtaacagggggaaacatggc 

tacctatggccataccaacacatatagtagctttaccagcaaagccagtacagtctt 

gcagccaataaatccaacagaaagttatggacctcagatacctattacaggagtca 

gcagacctagcagtagtgacatctcttctcacactactgcagaccctagcttctcca 

gtcaccctrcaggttcacacaccactgcctcgtcitratttcacattcctagaaacaa 

caatacaggtaacttccccttgtccaggcacttgggaagagagaggacaatttgga 

gcagagggagagttaaaaacccacatagaaccccagttctccgacggcatagacac 

aggactgtgaggccagcaatcaagggacctgctaacaaaaatgtgagccaagttcc 

agccacagagtaccctgggatgtgccacacatgtccttccgcagaggggctcacag 

tggctactgcagcactgtcagttccaagttcatcccacagtgccctccccaaaacta 

ataatgttggggtcatagcagaagagtctaccactgtggtcaagaaaccactgttac 

tatttaaggacaaacaaaatgtagatattgagataataacaaccactacaaaatat 

tccggaggggaaagtaaccacgtgattcctacggaagcaagcatgacttctgctcc 

aacatctgtatccctggggaaatctcctgtagacaatagtggtcacctgagcatgcc 

tgggaccatccaaactgggaaagattcagtggaaacaacaccacttcccagccccc 

tcagcacaccctcaataccaacaagcacaaaattctcaaagaggaaaactcccttg 

caccagatctttgtaaataaccagaagaaggaggggatgttaaagaatccatatca 

attcggtttacaaaagaacccagccgcaaagctrcccaaaatagctcctcttttacc 

cacaggtcagagttccccctcagattctacaactctcttgacaagtccgccaccagc 

tctgtctacaacaatggctgccactcagaacaagggcactgaagtagtatcaggtg 



CCAGAAGTCTCTCAGCAGGGAAGAAGCAGCCCTTCACCAACTCCTCTCCAGTGCTTC 

CTAGCACCATAAGCAAGAGATCTAATACATTAAACTTCTTGTCAACGGAAACCCCCA 

CAGTGACAAGTCCTACTGCTACTGCATCTGTCATTATGTCTGAAACCCAACGAACAA 

GATCCAAAGAAGCAAAAGACCAAATAAAGGGGCCTCGGAAGAACAGAAACAACGC 

AAACACCACCCCCAGGCAGGTTTCTGGCTATAGTGCATACTCAGCTCTAACAACAGC 

TGATACCCCCTTGGCnTCAGTCATTCCCCACGACAAGATGATGGTGGAAATGTAAG 

TGCAGTTGCTTATCACTCAACAACCTCTOT'CTGGCCATAACTGAACTGTTTGAGAA 

GTACACCCAGACTrrGGGAAATACAACAGCTTTGGAAACAACGTTGTTGAGCAAAT 

CACAGGAGAGTACCACAGTGAAAAGAGCCTCAGACACACCACCACCACTCCTCAGC 

AGTGGGGCGCCCCCAGTGCCCACTCCTTCCCCACCTCCTTTTACTAAGGGTGTGGTT 

ACAGACAGCAAAGTCACATCAGCTTTCCAGATGACGTCAAATAGAGTGGTCACCAT 

ATATGAATCTTCAAGGCACAATACAGATCTGCAGCAACCCTCAGCAGAGGCTAGCC 

CCAATCCTGAGATCATAACTGGAACCACTGACTCTCCCTCTAATCTGTTTCCATCCAC 

TTCTGTGCCAGCACTAAGGGTAGATAAACCACAGAATTCTAAATGGAAGCCCTCTCC 

CTGGCCAGAACACAAATATCAGCTCAAGTCATACTCCGAAACCATTGAGAAGGGCA 

AAAGGCCAGCAGTAAGCATGTCCCCCCACCTCAGCCTTCCAGAGGCCAGCACTCAT 

GCCTCACACTGGAATACACAGAAGCATGCAGAAAAGAGTGTTTTTGATAAGAAACC 

TGGTCAAAACCCAACTTCCAAACATCTGCCTTACGTCTCTCTACCTAAGACTCTATTG 

AAAAAGCCAAGAATAATTGGAGGAAAGGCTGCAAGCnTACAGTTCCAGCTAATTC 

AGACGTTTTTCTrCCTTGTGAGGCTGTTGGAGACCCACTGCCCATCATCCACTGGACC 

AGAGTTTCATCAGGANTTGAAATATCCCAAGGGACACAGAAAAGCCGGTTCCACGT 

GCTTCCCAATGGCACCTTGTCCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCTCTGCATTTAATCCACTGGGCGTAGACCATTTTCATGTCTCTITGTCTGTG 

GTTTTTTACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTTCACTTT 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAA1UTGAGTCTTTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGGTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCrTTATGATGGGACTGAACTAAAACCATrGCAGTTGACTCATT 

CCAGATTlTrCTrGTATCCAAATGGAACTCTGTATATAAGAAGCATCGCTCCTTCAGT 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCTTACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACTGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATTGACACCTGCCAAAATTGAACA 

GAAGCAGTATTTTAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 



AGGCCTCTGGCTCCCCTGTGCCTGAGGTATCCTGGAGTTTGCCTGATGGGACAGTGC 

TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 

TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGGAGATTAT 

ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 

TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 

CrGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 

TTTTGGTTGCTGCCTTCCAACAATGTCATTTCATTCTCCAATGACACKjTTCACATTTC 

ATGCCAATAGAACTTTGTCCATCCATAAAGTGAAACCACTTGACTCTGGGGACTATG 

TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 

GTCTCTAAACCTCCATTAATCAATGGCCTGTATGCAAACAAGACTGTTATTAAAGCC 

ACAGCCATTCGGCACTCCAAAAAATACTTTGACTGCAGAGCAGATGGGATCCCATCT 

TCCCAGGTCACGTGGATTATGCCAGGCAATATTTTCCTCCCAGK^CCATACTTTGGA 

AGGAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 

TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTGTGITGGT 

AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 

ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACTGCTCTGTGGAT 

GGGAACCCCCCACCTGAAATTACCTGGATCTTACCTGACGGCACACAGTTTGCTAAC 

AGACCACACAATTCCCCGTATCrGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 

GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 

CATCGAGAAACTCATCCTGTTAGAGATTGGGCAGAAGCCAGTCATTCTGACATACGA 

ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTGTGTCTG 

ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 

AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 

AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 

GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 

ACTACCTACCCAGGAACATGCTCAGGAGGACAGGGGAAGCCATGCAGCTCCACTGT 

GTGGCCTTGGGAATCCCCAAGCCAAAAGTCACCTGGGAGACGCCAAGACACTCCCT 

GCTCTCAAAAGCAACAGCAAGAAAACCCCATAGAAGTGAGATGCTTCACCCACAAG 

GTACGCTGGTCATTCAGAATCTCCAAACCTCGGATTCCGGAGTCTATAAGTGCAGAG 

CrCAGAACCTACTTGGGACTGATTACGCAACAACTTACATCCAGGTACTCTGACAGG 

AAGGGGGAGACTAAAATTCAACAGAAGTCCACATCCACAGGGTTTATTTTTTGGAA 

GAAGTTTAATCAAAGGCAGCCATAGGCATGTAAATGAGTCTGAATACATTTACAGT 

ATTAAATTTACAATGGACATGCGATGAGACTTGTAAATGAAAGCATTGTGAACTGA 

AACCGAGTCTCTGTGGATCTCAAAGCAAACTOTAACTrAAGGCACTTTGATTTTGC 

CAACAAATAATAACAAACATTAAGAGAAAAAAATGATCCACTACGAAATAACAAAC 

GGCTAATGCACCTGAATTCTCAGTAAAAAGACCTTTCTCTCGCTAACAGTTGCCAGC 

TGCCTCGTGTCTGTTTCCTACCAATGTCACAAACATCGCACACAGGGTGAATGGAGT 

CAACGGGAAAGATTAAGTTTGCGGTCTGTGTAAATCTCAATGTACAAATATTCTGTC 

NCTGGTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAA (SEQ ID NO: 1) 
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MQWGREVSGmSLTAVCLVVTPGSRACPRRCACYVPTEVHCTFRYLTSIPDGIPANVE 
RINLGYNSLTRLTENDIT)GI^KLELLMLHSNGIHRVSDKTFSGLQSLQVLKMSYNKVQn 
RKDTFYGLGSLVRLIiLDHlsnsnEFINPEAFyGLTSLRLVHLEGNRLTmiPDTFVSLSyLQI 
FKTSI^YLFT^DNH.TSLPKEMVSYMPNLESLYLHGNPWTCDCIILKWI^EWMQGNPDI 
IKCKKDRSSSSPQQCPIXMNPRISKGRPFAMWSGAFI^^ 

ASTSPQDFIEPFGSJJSLNMTXXSGNKADMVCSIQKPSRTSPTAFTEENDYIMLNASFSTNL 

VCSVDYNfflQPVWQLXALYSDSPLILEPvKPQLTETPSLSSRYKQVALRPEDIFTSffiADVR 

ADPFVVFQQEKWLQU>JRTATT1^TLQIQFSTDAQIALPRAEMRAERLKWTMILMMNNPK 

I^RTVLVGGTIAI^CPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGKLELQMA 

DSFDAGLYHCISTNDADADVLTYR1TVVEPYGESTHDSGVQHTVVTGETLDLPCLSTGV 

PDASISWILPGNTWSQPSPJDRQnJWGTIJtlLQVTPKDQGHYQCVAANPSGADFSSFKV 

SVQKKGQRMVEHDREAGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 

MOiRDLniRRRGDSTLRRFREimRQLPLSARRIDPQRWAALLEKAKKNSWKKQENTTV 

KPVPLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMTSI 

ASGTEVSTVNPQTLQSEHLPDFKIJSVTNGTAWKSMNPSIASKffiDTTNQNPI^ 

IRDSAQAGRASSQSAIfl>VTGG^MATYGHT^r^YSSFTC^ 

SRPSSSDISSim'ADPSFSSIiPSGSHTTASSIJ^HIPRNNNTGNFFLSRHLGRERTIWSRGRV 

KM'mTPVLJIRHRIlRTVRPAIKGPANKNVSQVPATEYPGMCHTCPSAEGLTVATAALS 

VPS S S HS ALPKTNN VG VIAEESTT V VKKP1X1JFKT)KQN VDIEIITTTTK Y S GGESNHVIPTE 

ASMTSAPTSVSLGKSPVDNSGHLSMPGTIQTGKDSVETTPLPSPLSTPSIPTSTKFSKRKTP 

LHQIFVNNQKKEGMLKNPYQFGLQKNPAAKLPKIAPLLPTGQSSPSDSrTLLTSPPPALST 

TMAATQNKGTEVVSGARSLSAGKXQPFTNSSPVLPSTISKRSNTLNFLSTETPTVTSPTAT 

ASVMSETQRTRSKEAKDQIKGPRKNRNNANnPRQVSGYSAYSALTTADTPLAFSHSP 

RQDDGGNVSAVAYHSTTSLLAITEIJ^KYTQTLGNTTALETTLI^KSQESTTVKRASDTP 

PPLLSSGAPPVPTPSPPPFTKGVVTDSKVTSAFQMTSNRVVTIYESSRIiNTDLQQPSAEAS 

PNPEIITGTTDSPSNLFPSTSVPALRVDKPQNSKWKPSPWPEIiKYQLKSYSETIEKGKRPA 

VSMSPHL^LPEASTHASHWNTQKHAEKSVFDKKFGQNPTSKHLPYVSl^KTLLKKPRnG 

GKAASFTWANSDVFIJ'CEAVGDPIJ'IIHWTRVSSGXEISQGTQKSRFHVLPNGTLSIQRV 

SIQDRGQYLCSAFNPLGVDIiFIiVSLSVVFYPARILDRHVKEITVHFGSTVELKCRVEGMP 

RPTVSWILANQTVVSETAKGSRKVWVTPDGTLnYNLSLYDRGFYKCVASNPSGQDSLL 

VKIQVITAPPVIIEQKRQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFFLYPN GTL YIRS LAPS VRGT YECI ATS S S GSERR V VELTVEEGETIPRIET A S QKWTE VN 

LGEKLLLNCSATGDPKPRIIWRLPSKAVroQWHRMGSRIHVYPNGSLVVGSVTEKDAGD 

YLCVARNKMGDDLVLMHVRLRLTPAKffiQKQYFKKQVLHGKDFQVDCKASGSPVPEV 

SWSLPDGTVLNNVAQADDSGYRTKRYTU^GTLYFNNVGMAEEGDYICSAQNTLGK 

DEMKVHLTVLTAIPRIRQSYKTTMRLRAGETAVlJJCEVTGEPKPmTWLLPSNNVISFS 

^RFTFHANRTI^IHKVKPLDSGDYVCVAQNPSGDDTKTYKLDIVSKPPLINGLYANKT 

VIKATAIRHSKKYFDCRADGIPSSQVTWIMPGNIFLPAPYFGSRVTVHPNGTI^MRNIRLS 



DSADFTCVVRSEGGESVLVVQLEVLEMLRRITFRNPFNEKVIAQAGKPVALNCSVDGNP 

PPEriTOPDGTQFANRPHNSPYIJ^GNGSLILYKATRNKSGKYRCAARN^ 

DilGQKPVILTYEPGMVKSVSGEPLSLHCVSDGIPKPNVKWTTPGGHVIDRPQVDGKYIL 

HENGTLVIKATTAHDQGNYICRAQNSVGQAVISVSVMVVAYPPRIINYLPRNMLRRTGE 

AMQLHCVALGIPKPKVTWETPRHSLI^KATARKPmSEMLIiPQGTLVIQNLQTSDSGVY 

KCRAQNLLGTDYATTYIQVL (SEQ ID NO: 2) 
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Nael 



IrrrooAAc 



^CCAACCX^GATGCCCCTCAAC^ 
TXX^TGQAATACTACn^AACTTATTAAAAACAACQAClTO 
O^TQGTCTG^aGATCTTGAGTQAGOTAAOXAATCA 
ACrcACTQATAAffrOGCTATTTGTCTAlX3<^ 
TTTTTXnX7IX3TATAATArrAAQATGGAAATTTGCCAGTQCro 
TXnTTC^GTGGATTACXXXSAAGAAAATAATAAGTOAACTOT 



UTTGfiGi 



AOOTQACAAAQTATQ^TGACACAT TOGOAAAAGAOCACAGT^^ 
CIQATTCATATT AACTCX^OTTOtt^Tt^ 
AACTTCACCAQCTICCTACACAQAQCAm 
CGATfAAQCraCACTGAAACATOAGTCTGACTTQTTAATAATCT^ 
TGTAAAGTATGTQACCAAAGAQCATOOTGATGCTATTAACCT^ 
TAATTTTATQGTAGAAATOTC^ACAAGCITOTXSQAQGCrogAA 
GATQOCCTITCAqTTTTOAAAOTAATTCACT^ 
ATTTAACOCXXX^AGTAGCX^TAATTT^^ 
TClTCTAAAAAGACirnfflTATGCATAGGATCTAAATOTAAA 
AAATAGOAGAAACroAOCTAGAATAGQTATAAAATATXmK^TCMCTr^ 
ATAOmTATATAAtUQQTtXKiAAACKiAATATITOAAACATCrAGAAOTA 
AOTAQCOATCCTQG^^AAATACGTACICTAACACACTGCA^ 
GGAGCroAGGTCTGOAAOGCTCGACCrrrCKnTAGA^ 
GACXnTAGTCTOOAAGGAGCTTTGQAAQAATGACAAOCroTAQC 
QATtX3CATAT T1t%!AAGGCAAG70TXX^CATOOQQAf^^ 
CAGTGCTAAOOACXTITGTTOGTOOCACACAGGTCKX^AGO^^ 
TTOACTCTTTAATTCTACX^TTGATCATTAATG^ 
GATATOGAATATTACrAATrrGTACXXlAAqCCAOQCCAOCT^ 
ATlCTQTOOAOOCXttOTC AOQTO aQTT^ 
ATTACCTGAGATOAGrrf^TTTTTCITCACAAGTCTTTT^ 
CTQATC^GTAAGCATATAAAGTGAQAGCAGC^TQAATGTGTTCCA^ 
TOCTATTATAATQTQQAAATAAACTPACTITAA AAAAA 
GAQTAAATCTTOCATAAATATCATCrOCATlTATAAATrATrrrCATAAl^^ 
AAACCTTTAGAAATITrOTTAACACAAAGATCCCT^^ 
TOQTQ<K30CrOAOAGATQGAGCTrAAQAACrrAC^^ 
ATCTQCCTCAAAACATTTATCCCAAATGCTX^TO 
ACAQGGATTAAAC^GTCATTTTAGTtACATTItTrC^ 
CAQGATQTATCAGQATAACATCCAAOCTTAC^AAAQGA^ 
AACAACAACGACAACAAA«X^TAAAAATTATCAOGq<U 
GAAAAAGTATTATATGTTCTX^TroTC^^ 

ATAGACAAAGATOGTITATQCAAOTGGGGATGGCGAAGGATACrrGTAGATTAGAOGACA 

CAAAGCAACSAACTACAGAGTG AAOTAATCCAGAOACTTAATGTATAATATQAGGACiyn'A 

TrrAATAAT TCTATTTAAGATACACAGCAAACOAOTGrrATCTTACTAACAC^ 

ATA<IA<UQAATAAAOTOATAQATA«nTrQT^ 

TCTACACCTOAAACATAOATAACCAACAAAQAQOAAQAG^ 

OOAAGAOTACATTCGAAAGTCAXiACACCATTGTGTAQATOT 

GAAAGTAGCC AAOQAOCTaAAQOQATCTOCAACCCTATAGQTGOAACAACA^ 

AACCAGTACCCOGOAQCrCrrGACTCrAGCnXK^TATATATX^A AAAQATCKKX 

(X^TCACTOGAAAGAQAOGCnCATrOGACriTGCAAACT^ 

ArACCACIgaOCAAAAAGOOOOAGTQO^ 

GQACTTrrXXnATAOCATTOGAAATOTAAATOAGCTA AATAOCTAATA A A A A ATflflA AAA 

A Nbc2817 

AAAAAAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTrCACT^^ 
QCAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATI^^ 
TaTrATTGCltyrrariXiTTACCAa^TCATTAOCATA^ 
CTATGACATOTrnTn-ATTCAAGTATATCACATGC^^ 



OCXXX^OCCCCCITITXX^^ 

CACACACACM^CACXCACACACACCrnAtXTKrT^^ 

CACCCAAQCTTAATCXTrrTTCCntXXrroC^ 

CXXXX7IXXXXCTOTTAG(lAflATOOGAGAG^ 

GCCAQOCXXXXXX^CAQACKX^rcXX^^ 

<XbVOOAOCAAACKXX^Q<XXWXXn^^ 

OCCATIt!CCAQAA0C0GAOQCrATAgCTQCGTCACX7KX>XX!GQ^ 

ACCCCTA0A0O<nXK0CA0AaXX^AA00 

QAGCQOCTOCTAATCCXXTCGOgAAGTOAAOOOACCCAOA^ 

GCTQQQOTTCA<KKKXU0ACAqA0<3OTCKKUT0G0C AGAAG<SW^^ 
ACTCGAGGGGAGTTGWAOUAAGCMX^CCA^^ 
GGCAAAAGGAG<XJGGAAGGATTATATGTAGCCIWXiAOGCrITCATA^ 
TITJlCCAAAGCAAiGK^ATrrGAGGAGAAACC^^ 

AGTACTTGTAAGTTTAGG TG AGTTTC AGAACTAACTAACCTATACT ATTGAG<J<3AGAAGGAAG 
AOCATTCCAGCMlXXGCkQCAlQCAGC^^ 
AITOAIXUTACCATTAAAATAACAGAAGCCO^^ 
<ntJAAGACTAAGCCTACTAAG<XTrrr(MnKXXXnTGC 

TCAACCCTCTOCTCACAAGAV1 lUJ'l^lCrCTCk^ATtnTACAAGAG^ATTCAAAAGCAAOAC 
AGAAGACCK^ COTATTG<XAACTCTTT 
4 Nw4203 

TOTTAAATACAOTITGAOqcrTAAGTOTACO^ 

CITCTt3TATAACTAACTCTAAOOTG<^^^ 

GATCTTTOTTTATCTAATAATAflCAAGCTCATCT^ 

CAAAAATATTCAGTGATffrAACAGTGACAGTGTA^ 

TTAATTTOGGTTAAACTCATTCATAACAGCT<X^GG^ 

CACOTGCO(XriTAAAOATATITrCTAACAAGA<UAgCA 

CCATCACTQTgirAGTAAGCAQAOOGOTGTTTXXL^GCA 

TGCCTOC^AAfiTCAGAQACTCAG<XOTCCCAGCI^ 

AOTTGQCrrTOAAAAGOCCCATGTGTCCT 

GAAATTTGAOCTITriTrCTAGTGQOTTATTAf^GCT^ 

<X1AAATAAATAAAACACATATTAAATAATACAATGTTAG<AAAATTOATC^TATAAAGAA 

OTACATIX^TAAAlTX^TGTQAAAAATA^ 

TTTICTAAATI^CATOTACTATTTTTATATITA TAA 

ACmn-AAAA( WCTCTTG<m^AACAAOAQAQTTAA^ 

AACATTATTATQATCTtrTAATOAC^G<K3ATCCroCTTTTC 

AAGATATGTTTACAATAATAAGCOCArrTAGroATAAAGTCX^TGOOAAGCT 

ACTG<?ITrATAAAGAGAACAGTTTCCroAGTCrATOGAAGTTTACA 

AOrrOCTCTTTCKXAGATTTCACTAG^ 

CATTATAAGTQOAATAAQATAGGATCTCAAAGOAATOTCAATn^ 
CTTT^OOTCnTCTTT^ 

TAAITnTITOCAOTTCnATOGAATTCCAAAAAACA 

ACCTCIXlAAACTAGAACTACCAATCCATTACTGOqTATGTAACAAAGA 

OAATTTATrGCTACATTGTTCATTATTCAtXlACAOCCAAQAATOTGGA^ 

GCCGTCAAAATATGAAOGOATA AAGAAAATGTGGAAATGTGTAC^CAGAGTXXXIATGTO 

OCCATAAAAOAGTOAAATCA TOACATATGCA<XlAAATGQATt^ 

<X3CTAATCAA AACAAQACAGACTCAAAAAQQAAACACCQTOTAGCTT^ 

QAAGCTAGATTTA(>Crn7rACOTGCGCATGTGTQTTTAGAATm 

ATTtnAATXTOTOAOTOTtnAT^ 

TOQGTAOGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAOTAACATAGGTG 
CCAOOATOAAATGCATTAArnOTATQCTAACT AAAC^^ 
AACXlAGGGTOAAATCCCAGCACAQAGAAG<yX}AAOTA^ 
CAAOAAOCCATTrOCAGTTGCTGCCTOCTGGQAQOOOOQ^ 
ACTOTCTATAACAACX^OTTOAC^ 
TTTCATIlTirritTlTACTQlTntnTOTTg^^ 
ATTTGTTTCTTTTGAQAGAQAQAAQQAACATQAAAITGQOTQGOT 
ATCrGGAAGAAQTrOOGGAAAGAOAAAAATrGTATGGAGOATAT^ 
AACAAACAAAAGGTTCATTTTGCCACAAAAACX^^ 
CTTAAAGAAAATATTCXX^AATTATrCXSCAOAGTTQCTATGTAT 



CTTOAACTOCKXXrrA^ 
GTCACW^CCITOCrGAT^^ 

TtXATCXXAGACOHHX^ 

AAACACCTGTGATTACTTITAATAGTTTIX^^ 

TGGAGACAATGAAjCATTTTATCXXIAATAG^ 

GTTTGGGAGCXntXATAATTATGGGGCA^ 

ACAGTTTCTGTTAACAATCATrATGAT^^ 

AAAAG<^TTlXOTITOCAAriTATAAAT^ 

CCTTACTAAATCTOACAAAmTGa 

ATAAATOCACAAGAAAKKKUTATQC^TCACCT^ 

ATGCCAAA<^CTATTX3T^ 

TATAACCXjTTATACCC^OTGCCntKL^ 

CTCCCTATOAOGOAGG<K3CCI^^ 

GGCAGGTTTTATOATAACTTGACXX^GCT 

GAOAACATGCCTQMGAAC^TCAQATTATAGGCAGGO^ 

TGATTCATG<XXX^G<3GO^ 

AAAAAAAAACAGGCTGAGCAAGTGTC^GGAGCAAGTC^ 

TGTOTQCATCAGCnxnt^^ 

CAGTGATOAACAATOATGTOGAACfrATAAG^ 

TTGGTCATOATGTTTC^T^ 

CClTKKUCTAAATKnxnTATCTAT^ 

CTACACCAGAGIOTAATCTGGGCC^ 

GGGGCTTCTTAA(XXK5AACACAGTTOT 

Exwi3(Nuc$0$9)4 

TGTGGACTITAATTGACXM.TTCATCTQAAGCT 

CrCACTAGATOMCA<MAAATCA<^^ 

G<^TGGCATTCACAC^GTCAGTXUCAAGACC^ 

Noc8218 4 

(HtUGATAGCn^AGAGGGTGATGGAGGCraA 

AAAAGCTXKnXiATTCCOGTGGGAGACATK 

TCAAGGAAGCtATTTCAATIGTAATGCAAT^ 

CTOGGGAAAATGCrrACACAATtGCGAQGAOT 

AAAGCACAGCATOGCGGCAGACACCTGGAATTC^ 

CCCAGAGACTCATTGGCCAAACACICT 

ACrTOACTCAOAAACTAATffrXKJ^ 

CATGCATGAATAACKmTCCCAGAQAGAAGGGAAC^ 

AAdGAAGGAAGOMGGAAGGAAGGAAGGAAGGAAGAGACWa^^ 

AAGGAAGOAAAGGGAAAGGAAAAAAGAGATGGGGAGGGAGGGAAGGAAAG 

GGAGAAAGAAGAGAAGAAAGGAAAATAAATAAATTTTCAGGGAT^^ 

TTTTATC£ATAAAAGGTCATTTCCA 

AAGGGCAGTCTITAAGATAGTAGCATTITAT^ 

TTTTTATCXnCTACCATCTrcAAACTG 

TOTAAGAGTITrCACACACCGAAGT^^ 

MT0KX3GAGL^ACAAAAAGAGAC1XX^ 

TTOAGAAGCTAAirGAOGCrcX^TG^ 

ATGAAAAG<^C^TAAAGTACATATGra^ 

ATAATAGCAATCAAAQCAATAAC^CXXa^ 

ACTATGTUIAOTGTTrTOACrrrAGGGT 

ATTTAGGTCATCAGACAGCATTTGGAACATGTATC^ 

QCATQAAATTCIXXA(XTATGTITAT^^ 

GGGTTTX3AGOTTn , ACTCOCAGGTAACATITAQGG 

CTmAmATGAGGGATGTTGGTATTCCA 

GGTCAAAACAGAAAAATATGTAAAAAGTrATTGTTrr^ 

TTTTAAAAATGOTATGCITAGAACTAATrAAGAlTAGAT^^ 

AGAGAGGGATTTGATGtAATGCTAAAGCATCATGA 

CAGAATCAATTAAATTCATATTACTATAAAAGACAGCACGCCAG^ 

GAGTGGATAAACTGTOTAACGTGAGTGCTATO^ ~ 



ACATATTQCATCKnTATGlTrATATaAAATGTrT 

OTAAOAIXmACTTAG^^ 

ATCTTAATTTOCXK3AAAAGACAATTTCCT 

ATOCCTG1X30ATAT1X3TAATAAAOCAGCATO 

GCAGG^GGAAGTGAAATGAGATCKnAGA 

GGTGGAATCTAGATATGTTAATATATTGACAT^ 

AAAA(XAACAQOAGTGAGGOAGACAATACK3AACCAA 

TOTGTCnXK3AQACAOCATAATAAAACnX^ 111110111 O CTAACTAAAACCACTAAAATC 

TAAAAACAAAACATTTTTQCACAAQAATTAriTATTATTCAAT 

GAAGTTCAAGTTCATTGATAGTCFCATAAATUI^ 

TTTATrATTAATTACTCTTG^^ 

ATGCTTTTCACGATACACAAATOTAOGCAGAA^ 

GGAOAGGGAMCKS^^ 

ATOAACAAOAAAAAATGTGCLVCACATGCAQUA 

TATTCTAATATrAAOAAAATAATAAACXXAll IClUllKACrTATCACCCAOQCrCAACAQ 

ITATCITGGOCAC^GAIXXTOTXJr^ 

AATCCAATCCAGGG<^TGATGCT 

AAGTAACCATGACATCTCTC^^ 

CTATGTTCTAATITCrCTAA 

TGGAGTTMGGIXXX^TAGGATTAATCT^ 

E»n4?CH«elll804 r 

TTCirCUAACnTCXnTItjAGAAA^ 

AGClATAACAAAOTCCAAATAATItMCAAGGAtACITO 

TTCK&CafCCAX^^ 

ACXXATWACCTGTGACTGOC^TTT^ 

Nuc 11*80 f 

MjiAT vrixsLTixjiYimnvmw 

TATCXX^UVAAaTCCOOCATAAOCTOXXXXX^ 
CCCKKKATTCCCC^^ 
AGTGATGGCCAACTAGGCCATCTITTGATAC^^ 
TACTCWmcnTCATAATOTTXnT^ 

Nk1W7^ 

(SEQIDNO:3) 



Figure 10 



cDKX_rat 
g$noaic_Hu 

groomloj&u 

gtsAoSicUhti 

cDNfcjtat 
gignpcuio 

geaomic^hu 
cOMfcjnt 

gsAomic^hu 

gaaotttc_hu 
cDNX rat 

gonomicjhu 

gpsnoslc^htt 

cDKkrat 
gonomic fan 

<ranoolajm 
cDH&jrat 



5! 
10 
15 
20 

25i 
30 
35: 
40; 
45: 
50: 
S5 
60: 
65: 
70: 
75 
80: 
85 
90 



ogagagacgacagaaggttacggctgcgagaagacgaoagaagggtccag 



aaaaaggaaagtgctggaggggagtggggacaaaagoagcgaetaagtga 



atgtcacttcagtgactgaggccaggcaaaaogogcgggaaggattttgt 



gtagcttgggaccctttcatagacactgatgacacgtttaegoaaaatag 



aaatttgaggagaaaogccbgggccttcggaaaggagtgattgattagta 



cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 



gggaggaggaagagcattacagagtttccagcagcagcaggaaagctttg 



gttaatttggaaatggatgatagcattaaaataacagaagcgeetceagg 



tctotgaagottoagtcccoosgctgaaagocagaaaagactaagocoao 



taagacttttgatccctittggaagcaaagaactttccttccctggggtga 



agactotcot^agaagatttcctgtctctgcctatgttacaagaggaatc 



aaaaceaagaca$aaga$ctcaggatgcaggtgagaggcagggaagtcag 



oggcttgttgatotoccteaotgotgtctgcotggtggtcacocetggga 



gcagggcotgtootogcc?gctgtgoctgctatgtgcccacagaggtgcac 



tgtacatttcggtacctgacctccatccoagatggoatcccggooaatgt 



ggaacgaataaatttaggatataacagcottactagattgacagaaaacg 



aotttgatggcotgagoaaactggagttaotcatgotgoaoagtaatggo 



attcaeagagteagtgaeaagaecttctcgggcttgcagtecttgcaggt 



ottaaaaatgagotataaoaaagtcoaaatoattoggaaggatactttct 



CDNXjrat 951 

qqtlqkLo bu 1 

c&t&rat 1001 

9enoiilo_htt 1 

cOHA w rat 1051 

geao&io_fra 1 

cfitl&x&t 1101 

otii c^Jhn 1 

cDNA^rat 1151 

gvuotilcjhu l 

cDN&^rat 1201 

%B cDNAjeat * 1251 

■tQ genoSioJxii 1 

^ " c08A_ret 1301 

H g eivom lcjhu 1 

tii ~ 

*; 3 <»»jeafc 1351 

^ geacSic fcu 1 

ti 

s eDMA_rat 1401 

r s - 

aDtO^rat 1451 

Q genovicjbn 1 

MHfcjwt 1501 

CJ gonom±q__hu 1 

r <30Nfc*at 15S1 

cpanoa[ia_htx 1 

eDM&jrat 1601 

9@tX0&l0 _tlU 1 

dB&rat 1651 

gonoai o__hu 1 

cDNX_rat 1701 

genoaiejra 1 

dDNAjcat 1751 

0oao*io &tt 1 

cDNA_rat 1801 

ge&£*ic_Hu 1 

dDN&je&fc 1$51 

geno»£G_Hti 1 

ct&Kjcat 1901 

genomic Jh» 1 

ct*Gl_rat 1951 

ganoaic lsn 1 



acggactcgggagcttggtccggttgcacetggateacaacaacattgaa 



tteatcaaccetgaggecttttatggacfctaectagctccgcttggtaca 



tttagaaggaaaccggctcacaaagctccatccagacacatttgtctcat 



taagotatctccagatatttaaaacctctttcattaagtacctgttcttg 



tctgataacttectgacctccctccoaaaagaaatggtctootaoatgcc 



aaacotagaaagcctgtatttgcatggaaacccatggacctgtgactgcc 



atttaaagtggttgtotgagtggatgoagggaaaocoagatataataaaa 



tgcaagaaagaoagaagotcttooagtcctoago&atgteccctttgcat _ 



gaaccccaggatcitctaaaggcagaccctttgctatggtaooatotggag 



ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagagc 



ctggttactcaggaggacaatggatctgcctccacctcacctcaagattt 



catagaacQotttggotocttgtctttgaaeatgacanaantxitctggaa 



ataaggccgacatggtctgtagtatooaaaagccatcaaggacatcacca 



actgcattcactgaagaaaatgaetacatcatgcta&atgegtcattttc 



oacaaatottgtgtgcagtgta^attataatcagatcc&gccagtgtggc 



aaottctggctttatacagtgactotcctotgatactagaaaggaagece 



cagcttaccgagactccttcactgtcttctagatataaaoaggtggotot 



taggoctgaagacatttttaccagcatagaggctgatgtcagagcagacc 



ctttttggttccaacaagaaaaaattgtcttgcagotgaacagaactgcc 



accacacttagoacattacagatooagttttooactgatgctcaaatcgc 



tttacca&gggcggagafcgagagcggagagactcaaatggaooatgatoc 



cDNA_rat 

geAOttiO m JtCO, 

cOHAjrat 

cDHA_rat 
geao«Lo_fcti 

gO!16JLi6jra 

eOK&_rat 
gonoslo_liti 

cDNX_rat 
g900fci.C_hU 

gOftOttiXS _Jbu 

cDNX_rat 
gtttos&c hu 

greooaiojhn 

dDN&jrat 
renewal cJim 

goaottic_hu 

cDKArat 
gco o i ni. 0 

cPH\_rat 
g$noaic_hu 

gs&OBLio^jbn 

genoaie_hti 

ganomic_hu 

cDH\_rat . 
genoni.c_bu 

cONA_rat 



200 

205 

210 

215 

220 

225 

230: 

235 

240: 

245 

250 

255 

260 

2651 

270: 

275; 

280 

295 

290 

205: 



tgatgatgaacaatoaoa&aotggaacgcactgtoctggttggeggcaet 



attgccotgagctgtccaggoaaaggcgaoccttcacctcacttggaatg 
gottctagctgatgggagtaaagtgagagccccttacgttagcgaggatg 
ggcgaatcctaatagacaaaaatgggaagttggaactgcagatggctgac 
agctttgatgoaggtotttaccactgcataagcaccaatgiitgoagatgc" 
ggatgttctcacatacaggataactgtggtagagcccta_tggagaaagca 
cacatgacagtggagtwagoacacagtggttacgggtgagacgctcgao 
cttcoatgcctttccacgggtgttccagatgct*totattagotggattct 
tccagggaacactgtgttctrtcagcoatcaagagacaggeaaattctta 

acaatgggaccttaagaatattacaggttacgocaaaagatoaaggtcat 

taccaatgtgtggctgccaacooatoaggggccgacttttocagttttaa 

agtttcagttcaaaagaaaggccaaaggatggttgagcatgacagggagg 

caggtggatctggacttggagaaoooaactccagtgtttcccttaagcag 

coagcatctttgaaactctctgoatcagctttgacagggtoagaggctgg 

aaaacaagtctccggtgtacataggaagaacaaaeatagagacttaatao 

atcggcggcgtggggattccacgctccggcgattcagggagoataggagg 

cagctccctototctgcfccggagaattgacccgoaacgctgggoagcact 

tctagaaaaagccaaaaagaattctgtgccaaaaaageaagaaaatacca 

cagtaaagccagtgccactggctgttoccctcgtggaactcactgaogag 

gaaaaggatgcctctggoatgattcctccagatgaagaattcatggttct 



t 



genomic_ha 

genonio^hu 
cOKXjrat 

cDK&_rat 
gaaoaicjtm 

cDNA_rat 
gaAoai.o_lra 

cDKA_rat 
geno«ic_hu 

• oDHA_rat 

g <fflOttl C_Jhtt 

g^n/mJiCy Ira 

gttXLOMLo^ltfx 
c&K&jrat 

genomic_hu 

ganoslc_liu 

ge&o&lcjhtt 
cDK\jr*t 

gonomicjbu 

gd&OftiOjlU 
genomic_hn 
gondii o_hu 

gen<mi.e_hu 

eDHA^rat 
genoaio_hti 



300: 

305: 

310 

315: 

320 

329 

330 

335 

340 

345 

350: 

355! 

360 

355: 

370 

375: 

3ao 

385 

3*0: 
335: 
400 
405. 
410 



gaaaact&aggcttctggligtcooaggaaggtcaccaactgctgactctg 



gaceagtaaatcatggttttatgacgagtatagettctggcacagaagtc 



tc^actgtgaatooacaaacactaoaatctgagcaccttootgatttcaa 



attatttagtgta acaaa cggtecagctgtgaoaaagagtatgaacccat 



coatagcaagcaaaat&gaagatacaacoaaooaaaacccaatcattatc 



tttccatcagtagctgaaattogagattctgctcaggcaggaagagcato 



ttcccaaagtgcaoacoctgtaaeagggggaaacatggctacctatggcQ 



atacoaaoacatatagtagctttaccagcaaagccagtacagtictfcgeag 



ccaataaatccaaoagaaagttatggacctcegatacctattaoaggagt 



cagcagacotagcagtagtgacatotcttctcacactactgoagac^cta 



gcttctooagtoacccttcaggttcacacaccactgcotogtctttattt 



oaoattcctagaaaoaacaataoaggtaacttoooottgtccaggcactt 



gggaagagagaggacaatttggagcagagggagagttaaaaacccacata 



gaaecccagttotccgacggcatagao&caggactgtgaggccagcaatc 



aagggacotgctaacaaaaatgtgagccaagttccageeacagagtacca 



tgggatgtgccacacatgtccttccgcagaggggctcaoagtggctacfcg 



cagcaetgtcagttcoaagttcatcccaQagtgoootccccaaaactaat 



aatgttggggtoatagcagaagagtctaccactgtggtcaagaaacoaot 



gttactatttaaggaoaa&caaaatgtagatattgtfgataataacaacca 



ctacaaaatattocggaggggaaagtaaccaogtgattoctacggaagca 



agcatgacttctgctccaacatctgtatccctggggaaatetcctgtaga 



caatagtggtcacctgagcatgcctgggaccatccaaactgggaaagatt 



cagtggaaacaacaccacttcccagccccctcagcacaccctcaatacoa 



gonoadLc^fcv 

garwiio_bu 

cCN&^rat 
gcttonio hu 

go&osJLG 

<sDNA_rat 

cDNA_rat 
gcnps^c lui 

g€fiO ! &l.c sa 2ro 
cDH2\_rat 

cDHfcrat 
genonlcMni 

cDNA_rat 

<3DH\_rat 
g^noadcJKu 

geaoAic_hti 

<SDKA»t 
geaoBio bo 

CfDHXjcat 
gcnowic bu 

ODHAjrat 

cDHXrat 
genoslc fau 

cDNArat 
genomic fcu 



415 
420 
425 

430: 

435! 

440: 

445 

4S0 

455 

460 

465 

470 

475; 

460 

485: 

490 

495 

500 

505; 

510: 

sis: 

620 



aeaag^<^aaattctcaaagaggaaaactccottgc%ccagatetttgt 
aaataaccagaagaaggaggggatgttaaagraatceatatoaattoggtt 
tacaaaagaacccagccgcaaagcttcccaaaatagctcctcttttaccc 
aoaggtcagagttecccctcagattotacaaotctcttgacaagtccgoc 
accagctctgtctaoaaoaatggotgccactcagaacaagggcactgaeg 
tagtatoag^ccagaagtctctcagcagggaagaagcagcccttoaoo 
aaotcctotccagtgcttcetageaecataagoaagagatctaatacatt 
aaacttcttgtcaacggaaacccooaoagtgacaagtcctactgctactg 
catetgteattatgtctgaaacccaaegaacaagatccaaagaagoaaaa 
gaccaaataaaggggcctcggaagaacagaaacaacgcaaacaccaoooc 

caggoaggtttctggctatagtgoatactcagctctaacaacagotgata 

ooocottggctttcagtcattoocoacgacaagatgatggt^gaaatgta 

agtgoagttgcttatcactcaacaaootctcttctggccataactgaact 

gtttgagaagtaoacccagaotttgg^aaataoaaoagctttggaaacaa 

cgttgttgagcaaatcacaggagagtaooacagtgaaaagagcctcagao 

acacoaocaccactcctcagcagtggggogcccccagtgoooactcctto 

cccacctccttttactaagggtgtggttacagacagcaaagtcaoatcag 

ctttccagat^aogtcaaatagagtggtcaccatatatgaatcttcaagg 

cacaatacagatctgcagoaaccctcagcagaggotagccccaatoctga 

gatcataactggaaccactgactptoootctaatctgtttccatccactt 

ctflrtgccagcactaagggtagataaacoacagaattctaaatggaagcco 

totccctggccagaacacaaatatcagotoaagtcatactccgaaaccat 



cDNA_rat 

cDN\_rat 
-gooonlc hu 

go**>mie_hu 

cD*Ut_rat 
genoaic^hti 

go&ftaiojui 
aDHX.rat 

oEWfc^rat 

genoniejiu 

dDFA_rat 
genomic_hn 

cDN&xafc 
cONA_rat 

geneEic_litt 

gQ&o&lc_hu 

cDK\_rat 
gaaonio_hu 

geno»ic_hu 

oDKfcjeat 
go&o&lo Jin 

cONArat 
genoSiioJhu 

cDNA_rat 
g&ttonJtC 

<S3H\jcat 
gonosi o tin 

cDKfcjcftt 
ganc>m±c_hx* 



525 
530: 
535 
540 
545 
550 
555 
560 
565 
570 
57£> 
580 
BBS 
590 
595i 

eoo 

605 
610 
615 
620 
625 



tgag&agggcaaaaggccagoagtaagcatgtccccccacctcagccttc 
cagaggooagoactcatgcctcacactggaatacacagaagcatgcagaa 
aagagtgtttttgataagaaacctggtcaaaaoocaaottecaaacatct 
gccttaegtctctctacctaagactotattgaaaaageeaagaataattg 
gaggaaaggotgc*agctttacagttccagctaattcagaogtttttctt 
ccttgtgaggotgttggagaeccactgcccatcatccactggaooagagt 
ttcateagganttgaaatatcccaagggacacagaaaagccggttccacg 
tgottoooaatggcacctrgtccatccagagggteagtattcaggaccgt 
ggacagtacctgtgct otgca tttaa tcoaotgggtgtaga ccattttca 
tgtotctttgtctgtggttttttacocggcaaggattttggacagacatg 
tcaaggagatuacagttcactttggaagtactgtggaaotaaagtgcaga 
gtggagggtatgccgaggcctacggtttectggatacttgcaaaccaaac 

ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaaoacctg 

r 

atggeacattgatoatctataatctgagtctttatgatcgtggtttttac 
aagltgtgtggccagcaacccatctggccaggattcactgttggttaagat 
acaagtcatcacagctccccctgtcattatagagcaaaagaggeaagcea 
tcgttggggttttaggtggaagtttgaaaotgocotgcactgeaaaagga 
actceceagcctagtgttcactgggtcctttatgatgggactgaactaaa 
aceattgcagttgactcattccagatttttcttgtatccaaatggaactc 
tgtatataagaagcatogotccttcagtgaggggcacttatgagtgcatt 
gccaceagctcetcaggctcagagagaagggtagtgattcttactgtgga 



tat 7351 CttTGGAAGCAGAGTCACGCTCCATCCAAAXOGAXOCTTt^SAgatgagga 

te_h» 890 Ct aTGGAAGCAGAATCACAGTCCATAAAAATGGAACCTTGGAaa t tagga 

rat 7401 acatccGGCTTTCrTGACTCTGCG^ 

♦cjwi 940 atgtgaGGCTTT(^6ATTCAGCci3ACTl!rATCTGTGT<5Gcccgaa»tgaa 

rat 7451 ggaGGAQAGAGTGTGTTWTAQTOC^TTAGAA^TC^TAGAAATGCTGAG 

,e_hu 990 ggt GGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACT GGAAATGCTGAG 

rat 7501 AAGACCAACAITCAGAAACCCATTC^OSAJUA 

.ejm 1040 AA^CCX^CATTTAGAAATCCATTTAATGAAAAaatagttgcccagctgg 

rat 7551 gcaagcc^gtAGCACTGAACTGCTCTGTOGATGG^ 

.<3_htt 1090 yaaagtccacAGC&TTGAATTGCTCTGT?^ 

rat 7601 attacCTGGATCTTACCTGACGGCACACAGTTTGCTA^ 

.0 Ira 1140 ataatCTGGATTTTACCAAATQGCACACC^tTTTCCAATGCiACCAcaaag 



Hi 

en 

hi 



3 : 



rat 7651 tt ccccGTATCTGATGGCAGGCAATGGCTCTCTCAT CCTTTACAAAg caa 

,cjra 1190 ttatcaCTATCT(^TA<XAAGC^TGGTTCTTTTATGATTTCTAAAaoaa 

rat 7701 otcggaaca»gtCA£&^<^AT<^^ 

;cjra 1240 ct cgggaggat gCAGGAAAATATCGCTCTGOAGCT AGGftATAAAGTTGGC 

rat 7751 TAcatcgagaaactoatootgttagagATTGGGCAGAAGCCAGTCATTCT 

.qjra 1290 TAtattgagaaattagtoatattagaaATTGGCCAGAAGCCAGTTATTCT 

rat 7001 GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGOAACCGTTATCAC 

.ojra 1340 TACCTATGCACXSW3GGAoagtaaaaggoaTC3^^ 

rat 7851 TGCATTGTGTGT CTGAT GGgATCCCC^G<XJUATGTCAAGTGGACTACA 

.C bt* 1390 TGCATTGTGTOTCTGATGGaATCCCTAAGCClAAATATCAAATGGACTATG 
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tat 7901 CCGGGTGGccATGTAATCGACAGGCCTC^ 

iejxtt 1440 CCAAGTGG ttATGTAOTAi!^CAG<^CTCAAATTAATGGGAAATACATAt T 

tat 7951 GCATGAAAATGGCACGCTGGTCATCAAAGCAACAACAGCtca 

Lqjra 1490 GCATG AC^TGGCACCTTAGTCATTAXAGAAGCAACA a cagAG 

rat 8001 GAAATTATATCTGTAGGGCTCAAAACAGTGTTGGCCAggcagttattago 

.cjiu 1540 GAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCAtacaotgatta ct 

rat 8051 gtgtCAGTGATTOTTCTGGCCTACCCTCCCCGAAICATAAAct acotACC 

.ojra 1590 gttcCAGTAATGATTGTAG<X?AeCCTC^ 

rat 0101 CAGGAACATGCTCAGGAGGACAGGGCAAGCCatgCAGCTCCACTGTGT(^ 

>c Ho 1640 CAGGAGTATTGTCACCAGGACAGGGGCAGCCtttCAGCTCC^ 

»t 8151 CCTTGGGAATCWC^GCXSAaAAGTCACCTGGGAGACGCCAAGAGACTCC 

.cjm 1690 CCTTGGGAGT T CCCAAGCCAgAAATCACATGGGAGATGCCT GACCACTCC 

rat 8201 CTGCTCTCAAaagcaacagcaagaaaacccCATAGAAGTGAGATGCTTCA 

.<^h» 1740 CTTCTCTC3AeggoaagtaaagagaggaoaCATGGAAGTGAGCAGCTTCA 

rat 8251 CCCAOUlGGTACGCTgGTCATtCAGAATCTCCAAAOT 

.©Jm 1790 COTAC^GGTACCCTaGTCATTCftGAA^ 

rat 8301 tcTATAAGTGCAGAGCTCAGAACCTACTTGGgacTGATTACGCAACAACT 

.o Ira 1840 taTACAAATGCACAGCAAAGAACCCACXTGGtagTGATTATGCAGGAACG 



rat 8351 TACATCCAGGTACTCTGACAGGAAgggggagactaaaattoaacagaagt 
iO fan 1890 TATATTCAAGTAATCTGACATGAAataataaagtcaacaacatotgggca 
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ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGCCTGGTCGCCACC 
CCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCTACGGAGGTACACTGCACATTTCGG 
TACCTGACTTCCATCCCAGACAGCATCCCGCCCAATGTGGAACGCATCAATTTAGGATACAACAGCTTG 
GTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGC 
ATTCACACAATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAAT 
AAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGATTGCACATGGACCAC 
AACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTCAACTTTCTCCGCCTGGTGCACTTGGAA 
GGAAATCAGCTCACTAAGCTCCACCCAGATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATC 
TCTTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTAT 
ATGCCTGACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGCCATTTAAAGTGGTTG 
TCTGACTGGATACAGCCAGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAGCAGTGT 
CCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAG 
TGTGCCAAGCCAACCATTGACTCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCT 
GCTTTCATCTCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAGTCT 
GGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCCATTGCATTCACTGAA 
GAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTTTTGGTGTGCAACATAGATTACGGTCAC 
ATTCAGCCAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTG 
CTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACC 
AACATAGAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAGCTGAAC 
AGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCTCAAATCACTTTACCAAGA 
GCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATTTCAAGGGATAACAATACTAAGCTGGAACAT 
^ ACTGTCTTGGTAGGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGAT 
^} TGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGAC 
v3 AAAAGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGTATAAGCAGC 
AATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAACCTTTGGTCGAAGCCTATCAG 
H GAAAATGGGATTCATCACACAGTTTTCATTGGTGAAACACTTGATCTTCCATGCCATTCTACTGGTATC 
0! CCAGATGCCTCTATTAGCTGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAA 
^| GTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 
?*j GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAAGGACAAAGGCCC 
TTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCTTAAGGAG 
f : CCACCAGGTGCACAACTCCGTACATCTGCTCTGATGGAGGCTGAGGTTGGAAAACACACCTCAAGCACA 
' AGTAAGAGGCACAACTATCGGGAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGG 
GAGAATAGGAGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTGGAG 
tl AAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCACCCCCAGTGGTCACC 
|f! CAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAATTTATG 
p GTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGAT 
Joi AGTCCTATGACAAACATAAATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCT 
GAAGAACCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATAAACCCA 
ACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTCTTTCCACTGCTACTTGGA 
GCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATA 
ACAGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCACCAACAAACTATTATTA 
GAGTCAGTAAATACCACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCAC 
TTCTATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACAGCTGCTCAT 
TCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTCAGACGCTTTGGGAGGCAGAGG 
AAAATTGGCGGAAGGGGGCGGATTATCAGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGC 
ATTTTCAGGTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCACAGTGCTCAAT 
GTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGCAGCATTGTCTTTTCCAAGT 
GCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACTCTAGTC 
CAGAATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACCCACAATAAAATATTTC 
AGGACTGAAATTTCCCAAGTGACTCCAACTGGTGCAGTCATGACATATGCTCCAACATCCATACCCATG 
GAAAAAACTCACAAAGTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGATTCA 
GTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACAAGGTTT 
TCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAAT 
CAACATAAAGTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCTTTACCACAG 
AGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGTGTGATGCAAATTCCATCTAATACCTTG 
ACTACCGCTCACCACACTACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCC 
TTCCCACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATCATATCA 
ACGCAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATCACTTATGAAACCCAAACA 
GAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCAAAC 
ATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACT 




CATTCCCCACCAGASffiCACAACTGGGATTTCAAGCACAATCAGTTTTCTTTCAAGAACTCTTAATCTG 
ACAGATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTCTGAAACA 
ACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCATTAGACACTCCCATACCACCA 
TTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCAGTTACT 
GACACACGTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGG 
CACAATCTCCAAATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 
ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAGAA 
TACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAGCCAGAAGTAAGCATGTTG 
GCTACTACAGGCCTGTCCGAGGCCACCACTCTTGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAG 
AGTGACTTTGATAAGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCT 
AGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCTAACTCA 
GATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGGACCAGAGTTTCAGGA 
CTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCTGTCCATCCAG 
AGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTT 
CATGTCAC CTTGTCTGTGGTTTCCTATC CTC C CAGGATCCTGGAGAGACGTACCAAAGAGATCAC AGTT 
CATTCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACCTGGATT 
CTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGA 
ACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCAGGT 
GGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGG 
CAAGTCATTGTAGGCACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCC 
AGCGTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCAAGTTGTTC 
TTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGACAGGGGCACTTATGAATGCATT 
GCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACCAGCCCC 
AGGATAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAACTGCTCA 
GCCACTGGGGAGCCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAGTGGAGC 
TGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGTGTCTAC 
TTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCATGTTAGCCTAAGACTGAAACCT 
GCCAAAATTGACCACAAGCAGTATTTTAGAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGC 
AAAGCTTCCGGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCA 
ATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATACTTC 
AACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 
ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAGTAACAAAACCAACAAGAGA 
ATCAAAGCTGGAGACACAGCTGTCCTTGACTGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGG 
TTGCTGCCTTCCAATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTG 
ACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGTGGGGAT 
GACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACAAACAGA 
ACTGTTATTAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCTGAAGGGACACCA 
TCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATC 
ACAGTCCATAAAAATGGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGT 
GTGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATGCTGAGAAGA 
CCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGC 
TCTGTTGATGGTAACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCAATGGA 
CCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGGGAGGAT 
GCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 
GGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAGAATCTCTATCA 
CTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGGACTATGCCAAGTGGTTATGTAGTA 
GACAGGCCTCAAATTAATGGGAAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACA 
GCTTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCA 
GTAATGATTGTAGCCTACCCTCGCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACAGGG 
GCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCCTGAC 
CACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTGAGCAGCTTCACTTACAAGGTACC 
CTAGTCATTCAGAATCCCCAAACCTCCGATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGT 
AGTGATTATGCAGCAACGTATATTCAAGTAATCTGA 
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hunS&_5+ 3_co rrected 
vxms cDNA 5 



A*CC^tt»Aira3<W!TrACA^ 



w 

D 



ratjsDNA 

hunan_5 + 3_co rrected 
WXS cDNA 5 



*at_CDNA 

fauman w 5+3_oorrect*d 



rat_CDNA 

huaan_5+3 corrected 
aus cDNA 5 



CTTCAOCVACAGAGACARAGr^^ 



CCTG<MCTCTTCC3U*CAAA^ 



**t_cDNA 

hi2nSftJ5+3_oorroatad 



rat_cDNA 

human_5+3_oorreot©d 
cms cDNA 5 



GACAAGTCCTRCTGCIIACTG^^ 

AACAAcrccraccrrccR^^ 



r&tjcDMA 

huiaan_5+3_porxeoted 
BU$ cDHA 5 



A<»Af3CAAAAaACCAAA^ 
ASOUSWtCAmcaAAl^^ 



rat_cDSA 

humaa_5+3_oorrected 



CCXX^GGCAQGTTTCIW^rATAGT^ 
KXaOftltXAGASTTCTOQCTTC^ 



ratjoDNA 

huznan_5+3_correct«*i 
mas cOKA 5 



NOCAffrcwaxarrco^ 



ttttjCENA 
mu$ cDNA 5 



TTCAAGAACtCTMTCTGAC^^ 



r*tjbDNA 

ht»aan._5«f 3_corrected 
nms cDNA 5 



TTT<WGARAmCAACaOTTTO(»AAC» 



ratjcBNA 

human_5f3 corrected 
sua cDNA 5 



AGTC&&AAGAGCCTCA GfcCAC — -ACGftCGftCXSkCfCC^^ 

AACS&GG&AftQCAXCATTAG&CAiC^ 



kuman_5+3_ci6rrfccted 
mus cDNA 5 



)«Sii.5+3j corrected 



ratjsCNA 

hunLan^5+3_oorreeted 
mis cDNA 5 



rat_ctMA 

bunra_5+3_corrftat*d 
isu8 cDNA 5 



ratjsDNA 

l»iflaa_5+3jeorreoted 
srus cONA $ 



mas cDNA 5 



h\2&an_5+3__corrected 



HXffTCC3VGW3G(X»GC^^ 



ratjottCV 

JttatW_5t3_eorreotftd 
SUA oDNA 5 



h\Bsaan_5+3^oorrect^ 
SO0 cDNA 5 



ratjoCNA 

hu«Sft_6+3_oorrectedi 



CTOT(X3tfXZTAACTC^^ 



rat_cDNA 

frumgn_5*3 corrected 
JRU5 cDRA 5 



TCATCCACTG&^ 



huaianj5+3_ oorreoted 
XBU3 ctt*A 5 



OOTTCCAOrrOCTTCCC^^ 



rat_cDHA 

human_S+3_oorr«cted 



human_5 + 3_corrt*cted 



rat_cDNA 

buxwm_5+3joorr«cted 

W3B <5fik 5 



0O3G&K3CACrGrG<&A^^ 



rat^cDNft 



rat_ctfNA 

hOT»n_5+ 3_correc ted 
AUd cDNA 5 



hu»apJ5+3_oorrected 
mus cBNA 5 



M!TOTOTGGCC3V3CAACCC&i9GT<^^ 



Pi 



huiiSn_5+3_corrocted 
bus CDHA 5, 



sn 



htmaa_.5+ 3_corr ected 
xaus cDNA 5 



3 



nus cDKA. 5 



en 
a 



rat^cDNk 

bucSn_S+3_corrected 
mas cDKA 5 



rat_CDNA 

human_5+3 corrected 
ecus cDNA ? 



rat_cDHA 

feun^_$+3_corr$oted 
1018 cDNA 5 



rat_ODRA 

hoaanJ5+3jaorrected 
ttH8 cDNA. 5 



ratjoDNA 

hnn5n._5+3_ooi^:«ctcd 
mils cDNA 5 



OTUUaGCTGTCKrCGA^ 



xatjcDSA 

huaSa_5+3_correotod 



rat_cON& 

hujaan_3+3_correct^d 
mud c&cv 5 



rat_cDNA 

hataaxxJi+Zjeo treated 



rat_ct)N& 

huswjj&+3_corr*oted 
stud cDHA 5 



rat^cDKA 

hwman_5+ 3_oo rrected 



rat_cWlA 

humanJ5+3_correct«d 
stud CDMA 5 



r*t_cDHA 

fcuraan_S+3_ corrected . 



hua^i^S+Sjco rrected 
tuna c&A 5 



bun£m_5+3_correct*d 
bus CONA 5 



CTGTCCTTGACTGCG^^ 



rat_cW3fc 

burwn_5+ 3_co rrected 



OCRATQ^TQmTCCrrTCTO^ 



rat_ODNk 

femsari_5-*"3^corrected 
urns cDKk 5 



r&tjcDNA 

human_5+ 3_corrected 
mus cDNft 5 



human_5+3 corrected 



huffSti w 5+3joorr«cted 



IlX^CTGC&GAtXlASATGGQftXCCCA^ 



ratjcnNA 

hun£n__5+3 — corrected 
jnue dm 5 



hunM_5+3_corrected 
xnus cDNA 5 



htan5n_5+3jcorreot«<l 

ZQUS CDNA 5 



<7TG<^GAGAGCGTGTTG<3^^ 



rat_c0tSA 

hu»an_5+3_C0Mreoted 
jmis cDNA 5 



TCftGAAAaXATTCftfcC^^ 
TTAGAAATOOmSWGWtt^^ 



r*t_ct>NA 

hij»aaJ$+3joorrected 
mas cDHA 5 



GCTCTGTTGATGCrrMCOCA^^ 



r&tjjDNA 

hiocan_5+3_cjorx«oto<l 



TTGCaflACAGAOC&CAC^^ 
TTra2*A*GGft0C3OAAGT^^ 



rat_cDNA 

h\j»an_5 +3_co r r ectod 
me odka 5 



rat_cPNA 

huroan_5+3_corrected 
mud cUKX 5 



AC&TCW<3*aftCTCATC^ 
ATATTGAGAAAXT&jTCATATTAGAAATO^^ 



rat_c0NA 

h\Eftap_5+3_corrcctfed 
cDNA 5 



rat_cDHA 

hw*n_$+3_eorrected 
imm o5ja 5 



rat_cW<A 

huaan_5+3^oorrcctod 
ttUe cDRA 5 



T0<»W3GAMAT&CATACT(^^ 
TTAAMGGAAAJACMATTCC^ 



r*t_cDKA 

httt5jQ_5+3_oorrected 
nus cDNA 5 



ACGACCAAfiGAMTTA^TCI^^ 
AXGAC&3AGG2W\CTA!I^TCTC1^^ 



rat_CDNA 
mu5 cDNA 5 



ratjbENA 

human_5+3^oorroct*4 
mm cDNA 5 



TO^QGflJSCSvCAflC^GAAGC^ 



rat__ODMA 

ht3o5n_5+3_oorrected 
mus cDNA 5 



rat_cOKA 

hua5aJ5+3jcorrect«d 
anxs cDMA 5 



AtWSAAGTCflGAlOCTT^ 
A3X3GftAGTGRlSCAS^^ 



xatjootR 

Juswlii_5* 3__corrected 

TO13JCDHAJ5 



ATTOCTCSAGTCTATAAm^ 



*at w otMA 

fcyaafl_5+3_oorr«cted 
sua cDNA 5 



A32OTCAAGTAAtCTi3AGAXGAA AI7AIAAAGT---CAACAA CATCIGGQCA 



rat_cCNA 

buaan^S+S^corrected 
skua oUNA 5 . 



GGGTTTA!TT1 I WTGGAA<3&^^ 
GAAM!ATTTITTG$AAGAAGTTtAA!rGft^ 



rat_cENA 

human_5+3 corrected 
SOU CDNA 5 



ATJUCATTTACWSr&Sri&M^ GACXTOIAAATGftftA 

A33VCMTTACAGTATl!AAA2TIftC^ 



ratjcOKA 

lwaw«i^5+3_correct6<S 
VU9 CDNA 5 



GCATTCTGAftCTGAA ACOGAOTCTCTG — TGGft3WTCAAAflCRAftCTCTXfiAC3^^ 

GCATTATGAACTGAIG^TACTGATro^ 



r*t_cDNA 

bnman_5+3_corrected 



GGGMypmXETtTQOC^^ ACATTGAAACGGTTCACTAT 



rat_ot>SA 

hus5n_5+3_c6rrected 
inu* cDNA I 



GAAATftACZtfACGGCTAAT 
AAAftIAACWATGGaraKK3^^ 



rat^cDNA 

humaja^S+S^oorractad 
mua c5na 5 



TACCAAITGTCACAAACAttXX^^ 



rat_eDNA 

hman_5+3_eorracted 
nxis cDNA 5 



AITGG&OTC&ACGGGAAAJ^ 

A^QAGA(3lft!I<jQAAAATATTAAATOTGC — AATCT~A2OTATAftAtATTT 



ratjcONA 

bVBS*n_5+3_ccrreoted 
bus cDNA 5 



ra.t_cDNA 

huraan_5+3 corrected 
2BOS d>NA 5 



^lAAACArm-GATAAAAO^WgA^^ 
TOT (S<OTTAIAAATTTTTTTGCTAAAACCTAC^ 



AAAA 



(rat_cDNA: SEQ ID NO:7) 
(human_5+3 corrected: SEQIDNO:8) 
(mus_cDNA_5: SEQ ID NO:9) 



Figure 15 



rat 

huaan_5+ 3_corr ectfd 
nouse 5 corrected 



rat 

hunaa_5 f 3_eorrec ted 
noose 5 corrected 



MKLGYNSLVRIllETDFSGIJTCLELI/lIiJSHGIHT IKJKTTSDLQALQVLKKSXNKTBICL 



rait 

huinan_5+3_corrected 
xBoofie^S corrected 



m 



a 



3 S 

en 



r*t 

hu»an_5+3__corrected 
«ouse"5 corrected 



rat 

lmnan_5+3_corrected 
mouse" S corrected 



rat 

Jnsnaa_6*3_coreected 
ttouse 5 corrected 



rat 

bufflan_5+3_oorrected 
stoude 5 corrected 



r*t 

huaaa_3+3_corrected 
*auae 5 corrected 



r»t 

hiaaan_5+3_oorxected 
ramee S corrected 



rat 

wousa 5 corrected 



ret 

taaaa_$+3joorreoted 
aouae 5 corrected 



rat 

hun^J5+3j»rrected 
moused corrected 



rat 

feuu»n_5+3joorrected 
»ouse""5 corrected 



rat 

huroaxi_5+3_corrected 
moused corrected 



PLRTPLVELTDEBKORS^^ 
FVVTQI#WI?«EEt>DSSafclIAI^ 



rat E^-TVHPQTLQSEHI£D£raFSVTN^^^ SV 

hun^_5+3j»2XeCted EFS PVVNSOn«H 1, Bi£PTDFI^STAlI^T I AMSKNINPXMS ^IQGTWCJHSSTVFPLLLGA 

»ou0e_5__corrected — — — — — - — , 

rat ARIRDS&QMatAS-~S^ 

hlgna n_5f 3_correo ted TEFQDSDCHGK^tfiaFB^rlTVRSMIK^^ — LLESVNTTNSH — QT 

3noufl$T5^correot«d ™ - — — 



rat PITGVSRPSSSDISSETIADPSFSSHPSGSHTT^ 

huaan_5+3_oorrected SVREVSEPRHNHmHTTaXI^STFPSDPH^^ 

inouse^^corcectad — — — ■ . — 

rat IWSRGRVKNFHRt^VIMHRHR^^ 

hu«ail_5+3jcorxeoted IGGE/^ISPYRTPVLRBHRYSIFRSTTR&SE^ 

&ouee S corrected — — — — — — — 



rat TJu&swsssasKLBKnmtt 

jnouse35_correct«d — — 

rex NHVI*T£ASM!FSAP?SVSI^P^^ 

hUW_5+3_oorrected SC^PT<aVMTtAPTSIIMEK^^ 

Kiouae S corrected -~— — ~ 



rat TS*KFSKMCrnH&IF^ 
h\»anJ5+3_oorreoted *aTRFSftMCIPRQ#8Vl^ 

uouse 5 corrected — ^— — — - — ~ — — 



rat mTSPPPALSTTMAATCm3TEVV$<^ 
hwaan_5+3jcorreoted TIiST3VMQIPSm?£raHHTTT^ 

oouse 5 corrected — — — — * — — 



rat l£TETP^VT3PT?mi3VI^E^ 
tuman_5+3_correotad XGTQlMPftmTFPASVircETQT^^ 

mouse S corrected « — — «~«~ 



rat * 

huzwm^5+3_corrected 
mouses corrected 



TJ«TP~PMAFEHSPPENT3GI$$TISFHSR^ 



rat 

human^M^corrected 
bous«T$ corrected 



I^K$Q&m , VKRAS-DTP-PFI£$&<^ 
LSSKSBQSmBlMIOTmmSSSMlKPm 



rat mYESSSHNTDfcQQPSAE^PNmiTCOT 

kun>ajaJ5+ 3_co rrected VKLHE3SRIW — LC^SSQLE? ^TSBTOtlUJIST PMEftLTTVKSQSRSKLTPS? 

ttOtt$e_5jcorrected 



rat 

huraan_5+3_corrected 
mouse~5 corrected 



WAEYQFBHKPYSDIAEKGKKFEVSMIATTGLSEA^ 



rat ^6KHIJYVSLPimL!aCPaiIG(38M 

huaai\_5+3_corrected TTSKLLPFDSLSRYI FSKPRIVtSOCAASFTI PASSKVFLPCKAVGNPLPTIHWTRVS-^L 
nouse 5_corrected ^ 



rat EISO^KSRFHVLPNGn/SICfcVSI^ 

hunfln_5+3_eorrected DLSim^RVOVtmSn^I^ 

mouee_5_corrected — -- ^ 

rat BHVKE ITVEFGSTVEXKCRVBQG^PTVSWIIANC^ 

huaan_5+3_<x>rrected wuraiTVHS<3STVEIJC(3ttE^ 

mouse 5 corrected — — — — ■ — — 



rat LSLYI^mCVA$^fiGQ&SIJ.V^ 
human_5+3_corrected ISVnmTXKCMWXPGGb^ 

mouse 5 corrected — « j 



Wit PQP$VHWVLYIX3TELKPI£LTHS 

huaan_5+3_correoted PQPSVYWVLSDCTEVm^mSKLFLFSHOT,^^ 

nous e_5_cor rect — — — 

rat VILTVEEGOTIFRIETfcSQKll^ 

human_5+3_corrected VtoTMJ^VTSPRIEA^KKYSVNTG — - 

mouse~5_corrected = — 

rat GSRIHVYPNG^VVGSVTEKDAG^^ 

human_5+3_COrrected GSITIHVYPNGSLFIGSVTEKDSGVYIC^ dhkqyfrkq 

«ouee_5^correct*d 



rat VI£G*I>FgttXDX?lSGSPVPBV^ 

hU»an_5+3_COrreotftd VLH^DFOVDCKASGSPVTEI SWSLPIXxTNItQ^HQ^DSOHRTRRYTLFNNGTLy 

tiouae_5joorreotftd . — — — — „ w 

rat GMaEEGDYIC^AQtfT^^ 

«ouse_5_corrected ■ — — — . J — — 

rat KWTCFKIJ^SmiVISFSlfi^^ 

human_5+3_corr*cted KEKIfTOiIiP3*rc*a$mc^ 

aouae 5 corrected — — - 



rat IV$KPPLINGLXANKTVIIMAIRfl$!OT 

hunan_54-3_ccrrected VVSKmiHGLYTHRTVIK^VRHS^^ 

MOuse_5_corrected ■ — — — . 

rat VTVEFHCTLEMRNIIU^DSJ^ 

hwaan^5+3 M Oorrect*d ITVH*a*GTLEIFOTRI£D6ADFIC^^ 

aouse_5_corrected 

rat OAfiKPVJVLNCSVDGNPPraiTW^ 

hunan_5+3_COrrected QLGKSTALNCSVDGNPPPE I nTCLPNtjTftFSNGPQSYQYLIASNGSFlI 

»ou$e 5 corrected — 



buman_5+3_corr«cted RCftRRflKVGYTEKLVri.r,T C^rPVTT.TY^pQTVTTrTf?^ ^ WTVffPn T PE HN 1 KW'irTP 
fiaonse 5 oorr^ct^d * — . 



rat 

huffian_5+3_correoted 
mouse""5 corrected 



rat 

huftan_5+3jcorrect«<l 
5 corrected 



rat 

lactase 5 corrected 



LVIQNPQTSDSGiyKCTAKKPLGS DXAJVTYIQV 



w 



(rat: SEQ ID NO: 10) 
(human_5+3_corrected: SEQ ID NO: 1 1) 
(mouse_5_corrected: SEQ ID NO: 12) 



in 

u 



Figure 16 



rat 

human 5+3 corrected 



*.*.** **;*• : .:***. ***.;*********$********.********.**. *** 



rat 

human 5+3 corrected 



*********,** *„**,**************** j # ***** t **«*********** j ; 



rat 

human 5+3 corrected 



******** ** i *+*t**A*+*4<*** i t*** t ********* !**************** 



rat 

human S+3 corrected 



rat 

human 5+3 corrected 



♦ ** ****|* j**********j ******* ;*******«****««*«««*.*.« *«. 

******************** ***.*.***.«* *.************. **.. 



rat 

huaaa^5+3_oorrected 



SAST$PQI^IEPFGSLSLNHTXX3GNKM^CSIQKP 

** *** *. *****.**** ******************* *********♦**♦***# 



rat 

human 5+3 corrected 



FW^IDYC^QPVWQIL&LYSDSmLEItf 

***-*********-*************- A-**** -***** ********** 



rat 

human 5+3 corrected 



m,l^PSHI*iQDQISIiQLNRT^ 

*;**«* *. *:;* A;********!******)******.*******. • ***** ** 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



9 *********** j % * .*** | *** J** J J *********»**"************ m ******* 

ADSFDTGVYHCISSNYDD^ILrmTVVSPLVEAYQBNG 

***** •♦•*****.* ************** *♦ •;^*»«*** s ******** ***• 

pi^is*iLPGwm^pSKi^nto 

******** ;*** s *. *,***. . .*****************. *|********,** *}* 



rat 

human 5+3 corrected 



SVKMKCSQRPI^HDGET^GLDE^PlWm^ 

** : **** .*** *. ****.*,*. , :* j*.**;.*.. :::: 



rat 

human 5+3 corrected 



HRt)LIHI^GDSTIJ(RniEHW^QLPIiSARRI DPQRS2AALLRKAKKKSVPKKQENTTYKFY 
YBELTI^GDSTHRRFRE.^^ 

•«|* j****** ***** t *t <l * ********|***********. j*^*.***** ,* 



rat 

human 5+3 corrected 



PIAVPLVl^TDEEKDASG^PPi^RlV^^ 
PmQLPNIPC^DDSSGlLM^El^^ 

*:.. * ::..**.*:***:. .***** *** **** **.* ** 



tat 

human 5+3 corrected 



BVS-TVtfPQTI^EHI^FKIJSVTN^ SV 

EFSP^^ItOPPEECTDFKI^AIKTTAMS^ 

* * ** * * * **** - . **••*•**••-*•*• ***** *** 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



SVR£P/$EPRHNHFVSHTTQn^STFPSOPHtAAHSQ 

,j ^. **** * * * ***+. * * : ** 



rat 

human 5+3 corrected 



* ,***: .*;*********j .*.: % ,** * : * . * **.* 



rat 

human 5+3 corrected 



rat 

humar^5+3_oorrected 



** * ** ****,.. *j .„* ss ;* 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



TLSTSVMQI PSNTITXAHRTTTKTH^^ ISKDSSUCS II 

** ** *.*::;tu, *:. . , **•: *: **. .*.***-.*** *.* 



rat 

human 5+3 corrected 



SXBTPT-VTSPmtAOTIHSSTQIlTRSKKAia^ 
STQTAIP&rTPTFPftSVrrTET^ 

**:*« t*j** .**+* *** **; . * • * t***.:.* :* * **i;t x 



rat 

human^5+3jcorrected 



. ,***;**♦ * +) jjjt* * a * «* . **** m * * **** 



rat 

human 5+3 corrected 



SSKSH0ST?m^DTmPFL6SSATU{mi^ 

***• ;*** % J }** *** **j***.* # ,****. . * *jj j* ** 



rat 

human 5+3 corrected 



KLHESSRHN — LOMPSSQLEP LTS STSSn&KSTO4PALT?VXSQK5KbTP£Ptf 

„ :: ****** ** * *.+.++*: **.:*>* . ***** >*** 



rat 

human 5+3 corrected 



PEHK*QLr3Y$ETim3KltfAV5^ 

,*n: *.**: ****:* *** f .*.**:*^.*.*a ** :s< ** ***** *; 4 



rat 

humanj>+3_oorreated 



rat 

hattan^S+S^oorrected 



rat 

human 5+3 corrected 



*** **j ;..**** j******»# j*>*** w ***.*****j*** ******* » • 

•***^*.** w ************* # *********** * j«*g**** 

.,***+** ********* # *# * ***;***********() }*•*• „ ** "****• ; = *W 



rat 

human 5+3 corrected 



* t ************* i ******+*****.*****.** 3 ** ^**^ + ************ 



rat 

human 5+3 oorreoted 



****.*** ****.#***♦*♦*; ;**• # ******* - ;* it * *********** ( ******* 



rat 

human 5+3 corrected 



KLTttmOTTSPRrattSQiO^TJ*^ G 

«**;** * ****J**** ****•*.**********,***.* ; * * 



rat 

human 5+3 corrected 



★ *********^ ;**»**»* j* a*****,*******.**** *** ^**** ****.*** 



rat 

human 5+3 corrects 



*******************;******** J j**^ ******;**.***** | *******.** 



41 



human 5+3 corrected 



rat 

human 543 corrected 



rat 

human 5+3 corrected 



j****** * ******************* ****** ** a ** ; ************* 

*••****+**; .«*** **|***** .*!*;*** ****j***** .******** **«*. 

VSKmiNGLYT^TVIKATAY^ 

***********j + j*******.***** ; ***** j* **** ( **#* b *** ; *;*;*; 



1=5 



rat 

human 5+3 corrected 



rat 

huaaa_5+3_oorr«cted 



*** A****.**; ******** **, *^ **************************t** • . ** 

** %i ************** ****,**.*.« *. t **;*,***,*; *j** ;- .+**•* 



rat 

human 54-3 corrected 



*************..************ A* ** t? ***_ ****************** *^ 



rat 

human 5+3 corrected 



r&t 

human 5+3 corrected 



*J*.**** : ;******,******* .ffAj*;*****.******} , 3 *j* # ** ; ******* 



rat 

human 5+3 corrected 



**** ********** *j* **-*t* ; ***** l 



(rat:SEQIDNO:13) 
(human_5+3_corrected: SEQ ID NO: 14) 



Figure 17 

MQKRGREVSCmSLTAICLVVTPGSRVCPRRCACYVPTEVHCTFRDLTSIPDGPANVER 
VNLGYNSLTRLTE^FSGI^Rl^LLMLHSNGmVSDKTFSGLQSLQVLmSYNKVQDE 
KDTLYGLRSLTRLHLDHNNIEFINPEAFYGLTLLRLVHLEGNRLTKLI1PD1TVSLSYLQIF 
KTSFKXLYLYDNFTSLPKEMVSSMPNLESLYLHGNPWTCDCHLKWLSEWMQGNP 
(SEQIDNO: 15) 

Figure 18 

MKVKGRGrrCLLVSFAVICLVATPGGKACPRRCACYMPTEVHCTFRYLTSIPDSIPPNVE 

RINLGYNSLVRlJvIETDFSGLTKLELLMLHSNGniTIPDKTFSDLQALQVLKMSYNKVRK 

LQKDTFYGIJ^SLTRLHMDHNNIEFPNPEV^ 

QIFKISFIKFLYIi>DNFLTSLPQE 

KCKKDRSPSSAQQCPLCMNPRTSKGKPIj\MVSAAAFQCAKP1TDSSIJCSKSLTILEDSSS 
AHSPQGFMAPFGSLTL^MTDQSGNEANMVCSIQKFSRTSPIAFTEENDYmNTSFSTFL 
VC^YGfflQPVWQILALYSDSPLD^RSHLI^ETPQLYYKYKQVAPKPEDIFTNlEADLR 
ADPSWIMQDQISLQL^TATTFSTLQIQYSSD^ 

I^HTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKVRAPYVSEDGRILIDKSGKLELQM 
ADSFDTGVYHCISSNYDDADILTYRrrVVEPL\^AYQENGIHHTVHGETLDLPCHSTGIP 
DAS IS W VIPGNN VLYQS SRDKKVLNNGTIJIILQ VTPKDQG YYRC VA ANPSG VDFLIFQ V 
SVKMKGQRPI^HDGETEGSGLDESNPIAHLKEPPGAQLRTSAmEAEVGKHTSSTSKRH 
NYRELTLQRRGDSTHRRFRENRRHPPPSARRTO^ 

SPPPVVTQLP^GEEDDSSGMLALHEEFMVPATKALNLPARTWADSRTISDSPMTNIN 
YGTEFSPVWSQIIJ>PEEPTDFKI^TAILTTAMSKNINPTMSSQIQGTTN 
ATEFQDSDQMGRGREHFQSRPPITWTMIKDVNVKM^ 
SEPRHNHFYSHTTQIl^TSTFPSDPHTAAHSQF^ 

PYRTPVI^RHRYSIFRSTTRGSSEKSTTAFSATVLNWCI^CLPPJELTTATAAI^FPSAAPI 

TFPKADIAR WSEKTTLVQNPLLLLENKPS VEKTTPTIK YFRTEIS Q VTPTG A VMT Y APT 

SIPMEKTHKVNASYPRVSSTNEAKRDSVrrSSIJSGArrKPPMTnAITRFSRR 

NNHNPKGRIJiNQHKVSLQKSTAVML^^ 

AHHITrKTHNPGSIi>TKKELPFPPL^ 

TQTERSRAQTIQREQEPQKK^TDPMSPDQSSGFITPTAMTPPALAFTH^ 
ISmSRTLNLTDVffiFXAQASTQTLKSTIASET^ 

LMPVPISPPFTQPJVVTDTRGDSHFRIJvlTNTVVKLHESSRHNLQMPSSQI^PLTSSTS^ 
HSTPMPALTTVKSQNSKLTPSPWAEQFWHKPYSDIAEKGKKPEVSMLATTGl^EATTLV 
SDWDGQKNTKKSDFDKKPVQEATTSKLLPFDSLSRYIFEKPRIVGGKAASFTIPANSDAF 
IJPCEAVGWIJT1HWTRVSGLDI£RGN^ 

FGTDHLHVTl^VVSYPPRILERRTKEITVHSGSTVELKCRAEGRPSPTVTWILANQTVVSE 
SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVILEQRR 
QVIVGTWGKLKLPCTAKGTPQPSVYWVI^DGTEVKPLQFTNSKIJLFSNGTLYIRNLAS 



SDRFTYECIATSSTGSERRVVMLTMEERWSPRIEAASQKIITEVNKjDKLLLNCSATGEP 

KPQIMRLPSKAVVDQGSWIHYPNGSLFIGVTEKDSGVYLCVARNKMGDDLILMHVSLR 

LKPAKroHKQYFRKQVLHGKDFQVDCKASGSPVPEISWSLPDGTMINNAMQADDSGHR 

TRRYRIiWGTLYFhKVGVAEEGDYTCYAQNT^ 

KRIKAGDTAAVU)CEVTGDPKPKIFWIJLPSN^^ 

E Y VC V ARNPS GDDTKM YKLD V VS KPPLINGLYTNRT VKATA VRHS KKHFDCR AEGTP 

SPEVMWMPDNIFLTAPYYGSPJTVHKNGTLEIRNVRL5ADnCVAJlNEGGESVLVVQLE 

VLEMLRRPTFRNPRNPFNEKIVAQLGKSTALNCVSVDGNPPPEIIWILPNGTRFSNGPQSY 

QYLIASNGSmSKTTREDAGKYRCAARNKVGYIEKLVILEIGQKPVILTYAPGTVKGISGE 

Sl^LHCVSDGIPKPMKWTMPSGYVVDRPQINGKYILIiDNGTLVIKEATAYDRGNYTCKA 

QNSVGimJTWVMIVAYPPRITNRPPRSIVTO^ 

LLSTASKERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 
NO: 16) 



Figure 19 



Leucbi£*rich repeat 
Signal Peptide 



NLS 



IgG repeat 



Figure 20 



B 



C3T3 



Cell lines A primary cells 



C6 

C2C12 
STO 

Bone marrow 
PC-3 
SAOM 
U20S 



R«< glioma 

Myoblasts 

Fibroblasts 

Stromal oefls 

prostatic carcinoma 

otteosarcomji 

osteosarcoma 



Articular cartilage chondrocytes 

fcOS osteosarcoma . 

Cafv*ria Oftfeobiafts 

C3H1W1/2 mesoocfcyme 

MCTO-E1 pr*o*ieoblas& 



<+) 



(+) 



Figure 21 




Figure 24 



Bone Marrow QxWar la 




GAPbH 
608 



tX*ys 5 5 15 15 



Figure 25 





















<- 606 



control _ Blood Biropen sciatic 



V»3 

Si 

iii 



Figure 27 




B 





Figure 30 



111 




Figure 31 




Figure 33 



S • 



•Vi •:• 



;-«t: 



...it:.. -> ...i-Tiy ■ ' '• • • 



Figure 34 



Protein 
{wimble 



Catvarid 



pcDNA 



608 



CcJvaria ♦ 608ApcDNA 




Figure 35 



Figure 38 




Figure 39 



-b»S ' • .... 




Figure 43 




(-3*3) 



t 



Xbtf 
1010 
(-2651) 



Figure 45 



mi 
0) 

T 



42KB 
(542) 



6SS9 67S5 
(2m) &»4) 



— t——i—i 

1 ' T 1*ATG 

BinHI > — 



Bxuffl 
5098 
(1437) 



JgL «» 11610 
(44ai> (4557) (7625) #019) 11967 



^ > <w 



(I) 



OoaeMClO/lSEn 
« 



Figure 46 



%rAl 
Bco47DI 



Pmel 




A«j>72« 
Acc65I 
Ncol 
Mlul 
Bgln 
Apal 



Figure 47 



JCI&g£AOTAAAAIX3ATCCTOAOTA 
AATCATCTCTCTGTGGTTTGC^ 

AOCTACCXjAATACAGAGCTATGACXnTAQTCTGGAAOOAQCTTrGQAA 

GTAGCIXKXX:AGAACATACTAQATGCCATATrrCCM 

CTTAAOAATATGGTTGTCTCrKKlAGTQCT 

GGTTAGTGCTAACTCIKIACTOCTTGAC^ 

ATCACTroOTOATTAOCAACTGGATATGGAATATTA 

CrCAQCTTTGOC^OCTCCATTCATTCTO 

ACTOTTTCrACTTACAAGAOX^mCCTO 

GAAGAGTCAATTAGAC^TATTCTGATGMOTAAGCATATAAAG 

TGTTCCATGTATGCTCATGGATGCTATTATAATGTXK3AAATAAA 

AGCTTATGATACTTGTCAC^GAGTAAATCTTCCATAAATAT^ 

TTTCATAATCCATCAATTAAAAACCTTTAGA 

CTCKXXTA<K3ATCKJTCTC3TATG<nXKK^^ 

AGGAGCACATCTTCACUACAKriXKXn^ 

ACTCAC^lxmJCTTCAAC^^ 

GGAAGCCTGCTAGACK3AACAGGATGTATCAGGATAAC^^ 

ATAACCCTC&CCACAACAAACAACAACGACAACAAA 

GAACTAAGOCATATGCAGAAAAAGTAm^ 

AGTCAAATATG^GAATATAGACAAAGATGGTTTATGCA^ 

CmTAGATTAGAGGACACAAAGCMCMCTACAGAGTGAAGTAATCCAGAGAOT 

TATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCA 

ACTAACACACACACmCATAGAGAGAATAAAGT^^ 

TAGCIXJATAATnt^TATTGTACACCTC^AACATAGATAAOCAACAAAG 

AGGTGCCTCTCCC^GGaC^AAGAGTACA 

CC^CATCCAGGAGCTAGAGAAAGTaGCC^ 

TGGAACAA(^TTATGAGCTAACCAGTA(XCCGGAGCTC^ 

AAAAGATGGCCTAATCGG<XATCaC^^ 

ATGCCCCAGTACAGGGGAATATOAGGGCCAAAAAGGGG^ 

G0GGTGGGTCK3ATATGGOGGACITITGGTATA^ 

CTAATAAAAAATOGAAAAAAAAAAAAAAAAAAAAAAAAAGO 

TTCACTOCTATCTCAACTTCK^^ 

GATTGAAGCGTCTTTGAATGT^ATIXKro 

TGTGAAAACTTACGOKWTCTATGACATOT^^ 

ATATTTOGCACCACTAO^QCCCCAGCCCCC^^ 

CACACACACACACACACACACACACAC^^ 

C^TCATCTGCTCACTCAC^ 

CTATTTTTATGTCTAGGTTCaXXTr^ 

AGAATTTOTAGCCCCTGAGCCAGCCCGGGCCACA^ 

CAGOGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGC^^ 

CXXXKKnWATGGCCCGCCCATTOXA 

axrrCCIXXCGAGTGAAGACX:CCTAGAG 

TTGGCCCAAGGCAGAGGTG^GCGGCTGCrAATCCCXnCGGG 

AGTCTGGTAGATGTGGGAGCTtXXXlTTCAGGGOGAG^ 

GGTCCAGGAAAAGGAAAQTACTGGAGGGGAGTTQQQACAAAA 

C^TCQCTTC&GTQACTO^ 

CXK^TTTCATAAAC^CTG^^ 

G<K>ACQTCCK3AAAGAAGGAGTGATCQATTAGTACTTQTAAQTrrATO 

TAAJHMCCTATACrATTQ^^ 

OCAATCAGATAAAQGAAAGCTTlXK^TTAGriTGGAAATGT^ 

Q MQCq( X7K^AqTTCTC^^ 



TATTGCCAACTCTrnXJTTAAA^ 

TTt^TXTAOTOCrciXTTCT^ 

CAOCTACCTTOTA(XAIxnTrGTITAT^ 

ACGCAOAGAOTATTCAAAAATATTCAGTOATC^AACAGTGACACTGT^ 

ATCATTAGTAAArCTTAATTTOGGTTAAACrc 

CACHX3AG<XTTCGCCACGTGCGGGTTiUAGATA 

C1TGG<XATGCTCCCX^TCACTO 

AQCAOACAQTOTTATGOCrGCAAAGT^GAGAC^ 

GTCCTCCCGGTCATTAGTTGGCTCTCL\AAAGtt 

AGACATGCTAGAAAGAAATTTGACXnTrTTTTCT 

TTTGGAAGGrTAAGCCAAATAAATAAAACACATATTAAATAATACAATG^ 
GATCATATAAAGAAGTAC^TTCATAAATGCAATGTGAAAAATAT^ 
TTACTGGIXK1AAAGTTTTCTAAATTCK!^CATGTACT 
AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAG 



TTAATTGTCCCGATAACATTAITATGATCI^ 
AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCC^TTTAGTO 
GGAAGCTAGCACACACTGGTTrATAAAGAGAACAGTTTCCTGAGTCT 
CTCTAGGGAATAAGAGTTCCTCTTTCTXXAGAT^ 
TTCTTGATGATGAGCAmTMGTGGAATA^ 

ATGCCCTQAACAATCrrTCAOGTC'l 1 1CT1 1C^QTTCACTAGTCTATTCATTTATIXK3ATAA 

TTGGGGGATGGTGTTAATTTTTTTGCAGTTCTTA 

AAACAAACAAAAAA(XTCTGAAACTAGAACTACCAATC^ 

AGAAATCTOCACAGAATTTATTOCTACAT^^ 

A(XAACTTACGTAGCCGTOLAAATATGAACGGATAAAGAA 

AGAGTCCCATGTGGCCATAAAAGAGTGAAA^^ 

GG AAATCAATTGGGCTAATCAAAACAAGACAGACTCAAAAAGG 

CTCroACAAAC^GAAGCTAGATTTA^ 

TAGTTATACACTATTCTMTCTGTGAGTGTOTAT^ 

AGOTAGCTGGGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACG^ 

OTMCATACWTGCCACKJATOAAATGCATTAATr^ 

GGCACA(XfTTCAAA(XAGGGTGAAATCCCAGCACAGAG 

TTCGCCACTAACCAAGAAGCCATTTC}^ 

CTCXXAGTCTGA^CTGTGTATAACA^ 

rTTTraTGCTATTITIX^TTX^ 

GCTGTGGTTrTCATTTGTTTCTITIXjA^ 

^GCTGGAAACGATCTGGAAGAAGTTGGGGAAAGAGAAAAATT^ 

ACAAACAAACAAACAAAGAAAAGGTTCATTTrGd^ 

:AGTTACGACItrTTAAAGAAAATATTCCC^ 

^AGGACTTTGCTTGAACTGGCCCT - Artificial 




Notlrite 



(SEQ IDNO:17) 
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CA(^GACCTTCCTCIT^ 

CAGCAACK^CACTCTGCTMTGCAACKK^ 

Q<^GACCCACAC(nTTCCAACTXKT^ 

GAAACCTKXrrGTGATCTXEAm 

TTCXrrcAAAOCTTTCCTCrcCAGCAAAC^ 

TCATIX5GGAACXXn^TGCTCCAT0CAAT^ 

CTTGGTCAGTGCAGTCTITIOTATC^ 

AACTGTTITITGTTATOATTGCTGC^ 

TCCTTTOTCTOCTCCTT^ 

ACCAAAGTGATATAGATTCATGGCATTCCTCTAACTGCT 

ACTTGGCAGGTGTAAMGCCraOAAGCAGTCATOAGGC^ 

CYOQAACCTrOGCATTW 

GGTCTTGCATGCrTTCTTGCAGTATm 

TAGCAAAAGTATACCTAGAATCTTC^m 

GAAGAAAGTTGGTAAATTCTItnt3ACAAATGGAGATrCCC^ 

AGCTAAGTCACACTCATATGCAAGAATTTACCAAGGCCT 

GCrrTATTCATGATAAGGTCTGCTAGAGCAGAACXXXXTrGGTG 

AAAGGTGTAGCATAAATTGTGACTAGAGroTOAAATCTTTACCTO 

GGCAGAGCTGTITrATCTrrACroTAAACATTAC^^ 

TCCTCTGTTTTGTGACAGATACTTCTATW 

TTTCTCTATTATATAAG<X7rGGTGTTCCCTTTC 

CCTTGTGTCTGlX>TCCnTITGT^ 

GTCTnTXX*CAGATTGAGGGAGG<^^ 

GCTCACTGTTGGTGACTAar^^ 

AAGTGTGGGTCCAGGAAGCAGGTTGTACACCCTGTCIX^ 

AAATAAGTrAACCCATCTXTTCCrro 

TGCTGCTGGTGTGTCTCCATCAGAGGGCAGAGCCAG^ 

TGTCCvmX3rTTCTT^ 

AAT(nXK3GAATGACTTTTCTAAGAAATGCCACGTO 

TAAACAAGATATATGGTGCG<^GTITATAATCATAATAAGCTTTGAAATAATA 

CATTCTCATTCTGCTTCCACGCra 

AAAGTAGTGATTTATGACCTGCTTTGTTCTGTTACT 

ATGTTGAAACATGGAACTrGGGGTGACCTGTATCTGTO 

TTTGTCTCCAGAATAAATGAGTTTATCAACra 

CGCCOGIXKSAGTCC^CCATICT^ 

ATGTGATIXDTTCCAACTCCAAATATTTCAAAI^ 

ACTAAGTCTATTGATTrGGAAGATCTCA 

TGAGGCTM(KrraAAAAATATOACAMTOAAATQTCAM 

AGTGTTAGGGGGTTTTAAAQAAACAAATACCTACTCTAMT^ 

CrATTCTAGTTTTGAATAATGTTa^ 

TTATCTraACGAATCTCAG 

aggtctcattcaaccacatctttatca^ 

attctixxagagcactgoaattg^ 

tgrixxh'aaacagagagttacaagacc 

tgacagaatgtcagagtggagctcccacixxkkk^ 

ttggaacatcacacaa^gaaattcx:aaattatg 

ACTTTTGACTCrrGAACATGAG^ 
GTGAAGGAAAGCTCGCCAACTCCTrm 

GACAGTATGCCAGTXXXAAGATCTGTCAATATGACTGAAGACACATTGGGATO 

AAGGCAGGAGAAGGCAAGCTACAGTGMGCCCAGTTCACTATAAAGCT^ 

AG ATAAG AAGGGTTTCTAATTTTTAAA ill 11 1 1 1 ATTAGATATTTTCTTCATTTACATTTC 

AAATGCTATCCCAAAAGTOXCT^^ 

CCATTTCOTGOKXXntXJTTACTTGT 

ACTATCTGGTCTATTGCAAGAATGGCTG<XX3AGCTCA^ 

AGCCAAACAGTGGAAGGAACTIXKjGGACTCTTAT^ 



GCCCXXKIATGG^ 

ACCCTitXKKKntrr^ 

CKXTOTOCACTCATATGTAACAOATATO 

GATtK3GGGTTAOG(KmKKKJATCKK3aOW 

TOTlXnTACTGAGCTGCCTAGlCI^^ 

GTAAAGACTIGAAGITCTtfc^^ 

TAGGAAGAGAAGAGGAG<KXK3AAGACTTGGG^ 

ATQAATAAGAAAAAAAAAATAAAAATAATTAAAAAAAAAAAAQAA1X3GCTO 
TACTCTAAAACGATIGGATCCCCOCC^ 
GATACTAACAATAGATGTAGGAAACTATTGTTAAC^ 
QAACCrnTTTATTOAAGCITrcACAG^ 

TAGCTTAATGATGTTCAGTGAATTAAAAATMCAATGAAGAAT^ 

AAGOATTCTIXXnAlATATITAAAAATCrAA 

TTTCTATATATGTTTACICTATOOT 

CAACATAGTATGTAACTTATAAGAAAGIAAAACATT^ 

CrTAOOCTCAGAAACAClXKKmTGAATAATO 

AGAATGTAATAACCTACTAAGGCAAACATAOAAGTTOAAATTA^ 

GAAGTAATAAGGCAAATAATGAATATTTG<nTrAAA 

GGGTGTGATTCTt^AGACITGACTC 

TTGAAACAATGTGTTGAAAATAATAAACAmCOTACT^ 

TTGCTTTITATrGCTGTGATAAGCACC^TGAOCAA^ 



(SEQIDNO:18) 
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TAAGCCTTTTGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCT 

CCTCAGAAGATTTCATGTCTCAGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGA 

AGAGCTCAGGTATAGCCAACTCITITGTTAAATACAGTATGAGGCTTAAGTGTACGG 

CAACTCATGTGGTATTCATTTGCGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCA 

TATAGTCTCTTCTGTTTCCCAGCTACCTTGCACCATCTITGTTTATCTAATAAT^ 

AGCTCATCTGCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTA 

ACAGTGACAGTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATATGGGTTAAACT 

CATTCATAACAGCTCCAGGTTGG (SEQ ID NO: 19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 

TTTATCCAATGGAATACTACTCAACTTATTAAAAACAACGACTTTCATAAAATTTTTA 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTCTATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCTCTGAGACTTTGGCAGATCAGACACTrAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACITGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATGCTATTAACCTTTGATGTTCTATGGACTCTTAATTTTATGGTAGAAATGTCA 

ACAAGCTTGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCITrCAGTTTTG 

AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTCCTAAAGCAATTTAACCCCCCAA 

GTAGGCATAATTTTAATGCTTACTTCATCAGAATATATCTAATTGACTOTCTAAAAA 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTGGCITTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCrCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCITGGTTAGAAATAACCTACCGAATAC 

AGAGCTATGACGTTAGTCTGGAAGGAGCTTTGGAAGAATGACAAGCTGTAGCTGCC 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGGTTGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTTTAATTCTCCCTTGATCATTAATGACTAGA 

AATCACTTGGTGATTAGCAACTGGATATGKjAATATTACTrAATTTGTACCCAAGCCA 

GGCCACCTCAGCTTTGGCAGCTCCATTCAlTCrGTGGAGCCCAGTCACGTGCK3TTTG 

AATCAACTGTACTGTTTCTAOTACAAGACGCATTACCTGAGATGAGTCATTTTTC^ 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 



TGAGAGCAGCATGAATGTGTTCCATGTATGCTCATGGATGCTATTATAATGTGGAAA 

TAAACTGACTTTAAAAAAAAAAGCTTATGATACTTGTCACAGAGTAAATCTTCCATA 

AATATCATCTGCATTTATAAATTATTTTCATAATCCATCAATTAAAAACCITrAGAAA 

TTTTGTTAACACAAAGATCCCTAGGCCCCTGCCCTAGGATGGTCTGTATGGTGGGCC 

TGAGAGATGGAGCTTAAGAACTTACTTGCTCCAGGAGCACATCTTCAGAACATCTGC 

CTCAAAACATTTATCCCAAATGCTCATCAAAGGCTCACTCACATGTGCTrCAACCAC 

AGGGATTAAACAGTCATTTTAGTCACATTTCTCAAACGGTCK3AAGCCTGCTAGAGGA 

ACAGGATGTATCAGGATAACATCCAACCTTACAAAAGGATGTCATAACCCTCACCA 

CAACAAACAACAACGACAACAAACCCATAAAAATTATCACGGCAAATGAACTAAGC 

CATATGCAGAAAAAGTATTATATGTTCTCATTGTGGGGTGTTTTTCCTTAATAGTCAA 

ATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCCAGAGACTTAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATCTTACTAACACACACACTTACATAGAGAGAATAAAGTGATAGATACGTTTGTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATTGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTrGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

CGGGGTCTATGACATGTTTTTTTATTCAAGTATATCACATGCTGTCAGCATATTTGGC 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 

ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 

TCATCTGCTCACTCACCCACCCAAGCTTAATCCTTTTCCTrCCCTGCAATAGTACCTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACGCCTGGGACGTCGGAAAGAAGGA 



GTGATCGATTAGTACITGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACTA 

TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 

AGGAAAGCTTTGGTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 

CCAGTTCTCTGAAGAGTCAGTCCCCCAGCTAGTGAAGACTAAGCCTACTAAGCCTTT 

TGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 

ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 

GTATTGCCAACTCTTTTGTTAAATACAGTTTGAGGCTTAAGTGTACGGGAACTCATG 

TGGTATTCATTTACGGCTCTCTTCTCTTATAACTAACTCTTAAGGTGCATATAGTCTC 

TTCTGTTTCCCAGCTACCTTGTACCATCTTTGTTTATCTAATAATAGCAAGCTCATCT 

GCTITTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTAACAGTGACA 

GTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATTTGGGTTAAACTCATTCATAA 

CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTT 

TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 

GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 

GAGACTCAGCCCTCCCAGCTGGTCAGTTTACTGTCCTCCCGGTCATTAGTTGGCTCTG 

AAAAGGCCCATGTGTCCTTATTGGCAAGGACTTGCAGACATGCTAGAAAGAAATTT 

GACCIIUIUUU'CTAGTGGGTTATTACAGCTGTAAAAGTATTTTGGAAGGTTAAGCCA 

AATAAATAAAACACATATTAAATAATACAATGTTACAAAAATTGATCATATAAAGA 

AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATTTACTGGTGC 

AAAGTTTTCTAAATTGCACATGTACTATTTTTATATTTATAAAAATAIT^ 

TATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTTACAAACTTTA 

ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCTTTTCATTGGGA 

AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 

ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 

GTTTACACTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACTAGCATTTGTTGT 

CATCATTTATCTTCTTGATGATGAGCATTATAAGTGGAATAAGATAGGATCTCAAAG 

GAATGTCAATTTGGATGCCCTGAACAATCTTTCAGGTCTTTCTTTCAGTTCACTAGTC 

TATTCATTTATTGGATAATTGGGGGGATGGTGGTAATTTTTTTGCAGTTCITATGGAA 

TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 

AATCCATTACTGGGTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 

TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 

AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 

AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 

CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 

CTAGATTTACACTTGTACGTGCGCATGTGTGTTTAGAATTTTATTTAGTTATACACT 

TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 

GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 

ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 

GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 

AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 



ccagttttctccagtctgacactgtgtataacaaccagttgacaatacaaagttggc 
atgatggatggtitttgtgctatttttcattttttttct^ 
ggttgttgtggtggtggctgtggttttcatttgttrcttttgagagagagaaggaac 
atgaaattgggtgggtaggaagctggaaacgatctggaagaagttggggaaagag 
aaaaattgtatggagcatatttaaacaaacaaacaaacaaacaaaaggttcatttt 
gccacaaaaaggtgtgaattaaattaaccagttacgActcttaaagaaaatattccc 
aattattcccagagttgctatgtatgctgtgcctaggactttgcttgaactggcccta 
taactctggtgtggtgtcttttcaggatgcagaagagaggcagggaagtcagctgct 
tgctgatctccctcactgccatctgcctggtggtcacccctgggagcagggtctgtc 
ctcgccgatgtgcctgctatgtgcccacagaggtgcactgtacatttcgggacctga 
cctccatcccagacgggcatcccagccaatgtggaacgagtcaatttagggtgtgtg 
gaccttgcctgatctccttctcagagagggaccactgattttcctggtactttgcccc 
ccaaacacctgtgattacttttaatagttitcttctaaaatgggttcatacaaacctt 
atattgtggagacaatgaacattttatcccaatagtcttttactagaacttgaagcc 
cctcttagttgtttgggagcctcataattatggggcagctttattctgaatgaatttt 
r :i aaatgaaaaagatacagtttctgttaacaatcattatgataccaaggaagaggaat 
u 4 tgtcattgaatattttaaaaaagcatttcttttgcaatitataaatacccatracaaa 
atggcttacttaaaatacttgccttactaaatctgacaaattatggtgatattttgaa 
p ggtttatgaaaatttgtttatgtgtataaatgcacaagaaatgggatatgccatcac 
£?i ctatgtgccattagtgagcatgtacagtatgccaaacactattgttcacgtttggag 
id gaagtaatgggggtgggggagcaacaagggttataaccgtatacccagtgccttgg 
p aagcgattgcaaacagtaaagactgacattgtgttctccctatgagggaggggcctt 
■ gggctgagcactttgcaatgagcatttgctcattgtgctggcaggttttatgataac 
h ttgacccaagctagagtcactggagaggaaggaacttcaactgagaacatgcctga 
^ agaagatcagattataggcaggcctgtggggcattttctraattagtgattcatggg 
2 gcagggcccagtccattgttcgtggtaccatttctcaggcactattaaaaaaaaaaa 
jt! aacaggctgagcaagtgtcaaggagcaagtcagtgagcagcagccctaatgatctc 
h tgcatcagctcctgcctccaggttcctaccctatttgagttcctgtcctagctcccta 
r " cagtgatgaacaatgatgtggaagtataagccaaataaatcctttcttccccaactt 
gctgttggtcatgatgtttcatcacagtgataatagtcctcatgaagatgctggtgt 
ttataacacctttggactaaattctgttatctatagctgaggaaaatggagcataga 
aagtctccagactacaccagagtgtaatctgggcctgagcttagaatcacacccac 
gtgcactccactgccggggcttcitaaccggaacacagttgtaaaagggaattttct 
gtttgtttccattitgacatgtggacttraattgacgattcatctgaagctgaaaatg 
a'lluuuluu'ccaggtataacagcctcactagattgacagaaaatgactntctggcct 
gagcagactggagttactcatgctgcacagcaatggcattcacagagtgagtgaca 
agaccttctcgggcttgcagtccttgcaggtgagataggtagagggtgatggaggc 
tgagaagagaggtgcaactgtgggttatacccaaaagctgctgattcccgtgggag 
acattctataagcattctataaactagaggcagatatcaaggaaggatttcaattgt 
aatgcaattttatgagaaaatttgaatattaagaaaatgctggggaaaatgcttac 
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ag^a^aSg^aaaggaaaaaagagatgggga 

GG^AICCC^ATV^TAAGGGCAXjT^ 

SS^=-=S^G^= 

i^^B^=^r A SAi^= 

TrrArrTATGTTTATTCTCTrAGCCAGGriTITCTCTGATGGAGAAACArrGGGTTTG 

A^ACT«S£^aW^ 
TATITATGAGGGATGTK3GTATrCCAGAAAATCCTCT^ 

2 



CATCACTAATGCTTTTCACCATACACAAATGTAGGCAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTTTGTATrCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGGCTCAACAGTTATCTTGGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCACTTACTCGTrcrGAATCCAATCCAGGGCATGATGCTTACTCCT 

ACACAGAACTAAAGATTAAAGAGAGTTTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCG ATAAGTTCTT AATATTTATGG CTGCTTGTGT ATGTTCTA ATTTCTCTA ATATT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTATAAACTriTCT 

GAGAAAACAAATATGCCCATATCTTTCTTTACAGGTCTTAAAAATGAGCTATAACAA 
AGTCCAAATAATTGAGAAGGATACTTTGTATGGACTCAGGAGCTTGACCCGGTTGCA 
CCTGGATCACAACAACATTGAGTTTATCAACCCCGAGGCGTTTTACGGACTCACCTT 
GCTCCGCTTGGTACATCTAGAAGGAAACCGGCTGACAAAGCTCCATCCAGACACAT 
TTGTCTCTTTGAGCTATCTCCAGATATTTAAAACCTCCTTCATTAAGNACCTGTACTT 
5 GTATGATAACTTCATTGACCTCCCTCCCAAAAGAAATGGTCTCCTCTATGCCAAACC 
^ TAGAAAGCCTTTACTTGCATGGAAACCCATGGACCTGTGACTGCCATTTAAAGTGGT 
3 TGTCCGAGTGGATGCAGGGAAACCCAGGTAACTATCTTGTITGTITGTTTCTTTTm 
U ATARKACGTATTTTCCTCAATTTCATTTAGAATGATATCCCAAAAGTCCCCCATAACC 
Cm TCCCCCCCACTTCCCTACCTACCCATTCCCATTTTTTGGCCCTGGCATTCCCCTGTACT 
W GGGGCATATAAAGTTTGCGTGTCCAATGGACCTCTCTTTCCAGTGATGGCCAACTAG 
P GCCATCTrTTGATACATATGCAGCTAGAGTCAAGAGCTCTGGGGTACTGGTTAGTTC 
J ATAATGTTGTTGCACCTACAGGGTTGAA (SEQ ID NO:20) 
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Figure 51 

MPKRAHWGALSVVLILLWGHPRVALACPHPCACYVPSEVIiCTFRSLASVPAGIARHVE 
RINLGFNSIQAl^ETSFAGLTKLELLMfflGNElPSIPDGALPJDLSSLQVFKI^SYNKLRVITG 
QTLQGLSNLMRLHTOFINKIEFTHPQAFNGLTSL^ 

I^TIRHLYLAENMWTLPASMLRNMPLLENLYLQGNPWTCDCEMRWFI^WDAKSRGIL 

KCKKDKAYEGGQLCAMCFSPKKLYKFffifflKIJCDMTCLKPSIESPLRQNRSRSIEEEQEQ 

EEDGGSQLILEKFQLPQWSISLNMTDEHGNMVNLVCDIKKPMDVYKIHLNQTDPPDIDIN 

ATVALDFECPMTRENYEKLWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEAL 

YYTGVRAQILAEPEWVMQPSIDIQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRS 

WVMffiPSGAVQRDQTVI^GGPCQI^CNVKASESPSIFWVLPDGSILKAPMDDPDSKFSIL 

SSGWIJRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLYQSPSTQPAEKDTVTIGKNPGES 

VTLPCNALAIPEAHLSWILPNRRIINDLANTSHVYMLPNGTLS1PKVQVSDSGYYRCVAV 

NQQGADHFTVGITVTKKGSGLPSKRGRRPGAKALSRVREDIVEDEGGSGMGDEENTSR 

PJXHPKDQEVFLKTKDDAINGDKKAJOCG^ 

INMANKQD^ERWADILAKWGKNIJKGTEWPLIKTTSPPSI^IJEVTPPFPAVSPPSASP 

VQTVTSAEESSADWLLGEEEFT^LGTISSASMGLEHNHNGVILVEPEVTSTPLEEVVDDL 

SEKTEEITSTEGDLKGTAAPTLISEPYEPSPTIJTI1.DTVYEKPTHEETATEGWSAADVGSS 

PEPTSSEYEPPLDAVSLAESEPMQYFDPDLETKSQPDEDKMKEDTFAHLTPTPTIWVNDS 

STSQLFEDSTIGEPGVPGQSHLQGLTDNIHLVKSSLSTQDTLLIKKGMKEMSQTLQGGNM 

LJEGDPTHSRSSESEGQESKSITLPDSTLGIMSSMSPVKKPAETTVGTIJJDKDT^ 

QKVAPSSTMSTHPSRRRPNGRRRLRPMCFRHRFIKQTPPTTFAPSETFSTQPTQAPDmSS 

QVESSLVPTAWVDNTVNTPKQLEMEKNAEPTSKGTPRRKJ1GKRPNKIIRYTPSTVSSRA 

SGSKPSPSPENKHRNTvHTPSSF^lIXPRTVSLKTEGPY^^ 

VTYKPTSDGKEIKDDVATNVDKHKSDILVTGESITNA1PTSRSLVSTMGEFKEESSPVGFP 
GTPTWNPSRTAQPGRLQTDIPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEE 
AGSSTTI^SIKVEVASSQAETTTIJDQDHI^TT^^ 

TimSLGQTTTTKPALPSPRISQASRDSKENVFI^YVGNPFTEATPVNNEGTQHM 

I^lPSSDRDAFl^TKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTA 

TVRIJEMSTQSASRYFVTSQSPPvHWTNKPEITTYPSGALPENKQFTTPRLSSTTIPLPLHM 

SKPSIPSKFTDRRTDQFNGYSKVFGNNNIPEARNPVGKPPSPRIPHYSNGRLPFFTNKTLSF 

PQLGVTRRPQIPTSPAPVMRERKVIPGSYNRIHSHSTFHLI) 

LQNIPMVSSTQSSISFTrSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 
TAETDTVFPCEATGKPKPFVTWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 
QYMCTASNIJiGIJ)RMVVIXSVTVQQPQ 

SWIFPDRRVWQTVSPVESRrTLFfflNRTLSIKEASFSDRGVYKCVASNAAGADSLAIRLHV 
AALPPVfflQEKI^NISLPPGI^IHfflCTAKAAPLPSVRWVLGDGTQIRPSQFLHGNLFVFP 
NGTLYIRNL APKDS GR YEC V A ANLVGS ARRTVQLNVQR A A AN ARITGTSPRRTD VR YG 
GTLKIJDCSASGDPWPRILWRIJSKRMIDALFSFDSRIKWANGTLVyKSVTDK 



• # 



DAGDYLCVARNKVGDDYVVLKVDVVNQCPAKIEHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWSLPDGSLVNSFMQSDDSGGRTKRYYVI^GTLYFNEVGMREEGDYTCFAEN 

QVGKDEMRVRVKVWAPATIRNKTYIAVQVPYGDVVTVACEAKGEPMPKVTWLSPTN 

KVIPTSSEKYQIYQDGT1XIQKAQRSDSGNYTCLVRNSAGEDRKTVW]HVN^ 

NPNPriTVREIAAGGSPJa.roCKAEGIPTPRVLWAFPEGVVLPAPYYGNRITVHGNGSLDI 

RSLRKSDSVQLVCMARNEGGEARLIVQLTVLEPMEKPIFHDPISEKITAMAGHTISLNCS 

AAGTPTPSLVWVLPNGTDLQSGQQLQRFYHKADGMLfflSGLSSVDAGAYRCVARNAA 

GHTERLVSLKVGLKPEANKQYHNLVSDNGETLKLPCTPPGAGQGRFSWTIJ'NGMHLEG 

PQTLGRVSLU^NGTLTVREASVFDRGTWCRMETEYGPSVTSIPVPVIAYPPRITSEPTPVI 

YTRPGNTVKLNCMAMGIPKADITWELPDKSHLKAGVQARLYGNRFLHPQGSLTIQHAT 

QRDAGFYKCMAKNILGSDSKTTYIHVF (SEQ ID NO: 21) 
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Figure 52 

ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC AGACTCTCCA 
CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
□ AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
& TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
$ AATCCAGAGG AATTCTGAAG TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT 
^ CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
£ ACACAAGCTG AAGGACATGA CTTGTCTGAA GCCTTCAATA GAGTCCCCTC 
}! J TGAGAC AGA A C AGGAGC AGG AGTATTGAGG AGGAGCAAGA AC AGGAAGAG 
£ GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
*" CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
I, GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
U GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
(3 TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
tf i ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
P CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
H CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGAAG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAAAG GTCCAAGTCA GTGATAGTGG 



TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
f i AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 

5 GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 

H AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
8* TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 
W GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 

6 CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
J CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
JT CAATCACATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
S TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
p CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
f j AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
U CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 

AAACCACAGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTGTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 




TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATAGCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
- TTTCACTTCC GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
Jn ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
J GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
v3 GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
H AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
CP TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 
W TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
O ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
» CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC 
P TCCCAGTCCA AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
% CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
& ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCCAGG 
U TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAACA TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC CAAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGjGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCAAAGGGAC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 



CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCACAT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
S CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
^ ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTGCCA 
,5 GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
£ CACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTAACGAAG 
Cfi TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
Ul GGGAAGGACG AGATGAGAGT CAGAGTCAAG GTGGTGACAG CGCCCGCCAC 
0 CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
}■ " TTGTCCCCAA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT 
£ ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
$ GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
i-! GTGTGGATTC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
t! CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
' " TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCCATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 



GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA 
ACATTCTCGG CAGTGACTCC AAAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCAATT TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA AAAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC CAGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 
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ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGCCTGGTCGCCACC 
CCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCTACGGAGGTACACTGCACATTTCGG 
TACCTGACTTCCATCCCAGACAGCATCCCGCCCAATGTGGAACGCATCAATTTAGGATACAACAGCTTG 
GTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGC 
ATTCACACAATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAAT 
AAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGATTGCACATGGACCAC 
AACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTCAACTTTCTCCGCCTGGTGCACTTGGAA 
GGAAATCAGCTCACTAAGCTCCACCCAGATACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATC 
TCTTTCATTAAGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTAT 
ATGCCTGACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGCCATTTAAAGTGGTTG 
TCTGACTGGATACAG GAGAAG CCAGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAG 
CAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATGGTCTCAGCTGCAGCT 
TTCCAGTGTGCCAAGCCAACCATTGACTCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGT 
AGTTCTGCTTTCATCTCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGAT 
CAGTCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCCATTGCATTC 
ACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTTTTGGTGTGCAACATAGATTAC 
GGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTGTACAGTGATTCTCCTCTGATACTAGAAAGGAGC 
CACTTGCTTAGTGAAACACCGCAGCTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATT 
TTTACCAACATAGAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 
CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCTCAAATCACTTTA 
CCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATTTCAAGGGATAACAATACTAAGCTG 
GAACATACTGTCTTGGTAGGTGGAACCGTTGGCCTGAACTGCCCAGGCCAAGGAGACCCCACCCCACAC 
GTGGATTGGCTTCTAGCTGATGGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTA 
ATAGACAAAAGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGTATA 
AGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAACCTTTGGTCGAAGCC 
TATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAAACACTTGATCTTCCATGCCATTCTACT 
GGTATCCCAGATGCCTCTATTAGCTGGGTTATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGAC 
AAGAAAGTTCTAAACAATGGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGC 
TGTGTGGCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAAGGACAA 
AGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCCAATCCTATTGCTCATCTT 
AAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTGATGGAGGCTGAGGTTGGAAAACACACCTCA 
AGCACAAGTAAGAGGCACAACTATCGGGAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGT 
TTTAGGGAGAATAGGAGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTG 
TTGGAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCACCCCCAGTG 
GTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGCATGCTCGCTCTACATGAGGAA 
TTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTGCTGACTCCAGAACAATA 
TCTGATAGTCCTATGACAAACATAAATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTA 
CCACCTGAAGAACCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 
AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTCTTTCCACTGCTA 
CTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCC 
CCAATAACAGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCACCAACAAACTA 
TTATTAGAGTCAGTAAATACCACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCAC 
AATCACTTCTATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACAGCT 
GCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTCAGACGCTTTGGGAGG 
CAGAGGAAAATTGGCGGAAGGGGGCGGATTATCAGCCCATATAGAACTCCAGTTCTGCGACGGCATAGA 
TACAGCATTTTCAGGTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCACAGTG 
CTCAATGTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGCAGCATTGTCTTTT 
CCAAGTGCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCTACAACT 
CTAGTCCAGAATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACCCACAATAAAA 
TATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGTGCAGTCATGACATATGCTCCAACATCCATA 
CCCATGGAAAAAACTCACAAAGTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGA 
GATTCAGTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACA 
AGGTTTTCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACCATAACCCAAAAGGCAGATTA 
AGGAATCAACATAAAGTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCTTTA 
CCACAGAGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGTGTGATGCAAATTCCATCTAAT 
ACCTTGACTACCGCTCACCACACTACGACCAAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAG 
CTTCCCTTCCCACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAAAAAGCATC 
ATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTCCCTGCATCTGTCATCACTTATGAAACC 
CAAACAGAGAGATCTAGAGCACAAACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGAC 
CCAAACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACACCTCCTGCTCTGGCA 
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TTCACTCATTCCCa^AGAAAACACAACTGGGATTTCAAGCACAATCJWTTTTCATTCAAGAACTCTT 
AATCTGACAGATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTCT 
GAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCATTAGACACTCCCATA 
CCACCATTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCA 
GTTACTGACACACGTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCC 
TCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTA 
CATTCTACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGG 
GCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAGCCAGAAGTAAGC 
ATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTTGTTTCAGATTGGGATGGACAGAAGAACACA 
AAGAAGAGTGACTTTGATAAGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCT 
TTGTCTAGGTATATATTTGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCAAGTTTTACTATTCCAGCT 
AACTCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCGCACCATTCATTGGACCAGAGTT 
TCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCTGTCC 
ATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCGCATCCAATCTGTTTGGCACAGAC 
CACCTTCATGTCACCTTGTCTGTGGTTTCCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATC 
ACAGTTCATTCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 
TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCTGTGGTGACGGTT 
GACGGAACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAAC 
CCAGGTGGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAA 
AGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCT 
CAGCCCAGCGTTTACTGGGTCCTCTCTGATGGCACTGAAGTGAAACCATTACAGTTTACCAATTCCAAG 
TTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTTCAGACAGGGGCACTTATGAA 
TGCATTGCTACCAGTTCCACTGGTTCGGAGCGAAGAGTAGTAATGCTTACAATGGAAGAGCGAGTGACC 
AGCCCCAGGATAGAAGCTGCATCCCAGAAAAGGACTGAAGTGAATTTTGGGGACAAATTACTACTGAAC 
TGCTCAGCCACTGGGGAGCCCAAACCCCAAATAATGTGGAGGTTACCATCCAAGGCTGTGGTCGACCAG 
TGGAGCTGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGACAGTGGT 
GTCTACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATGCATGTTAGCCTAAGACTG 
AAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTA 
GATTGCAAAGCTTCCGGCTCCCCAGTGCCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAAC 
AATGCAATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTA 
TACTTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAA 
GATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCC CGGATAAGGCAGAGTAACAAAAC CAAC 
AAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACTGTGAGGTCACTGGGGATCCCAAACCAAAAATA 
TTTTGGTTGCTGCCTTCCAATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 
TCTTTGACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 
GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACA 
AACAGAACTGTTATTAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCTGAAGGG 
ACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTTCCTCACAGCCCCATACTATGGAAGC 
AGAATCACAGTCCATAAAAATGGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTT 
ATCTGTGTGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATGCTG 
AGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGAAAGTCCACAGCATTG 
AATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCC 
AATGGACCACAAAGTTATCAGTATCTGATAGCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGG 
GAGGATGCAGGAAAATATCGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTA 
GAAATTGGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAGAATCT 
CTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGGACTATGCCAAGTGGTTAT 
GTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAA 
GCAACAGCTTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACT 
GTTCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGG 
ACAGGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATG 
CCTGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTGAGCAGCTTCACTTACAA 
GGTACCCTAGTCATTCAGAATCC C CAAAC CTCCGATTCTGGGATATACAAATGCACAGCAAAGAACCCA 
CTTGGTAGTGATTATGCAGCAACGTATATTCAAGTAATCTGA 
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VATPGGKACP RRCACYMPTE VHCTFRYLTS 
RLMETDFSGL TKLELLMLHS NGIHTIPDKT 
QKDTFYGLRS LTRLHMDHNN IEFINPEVFY 
DTFVSLSYLQ IFKISFIKFL YLSDNFLTSL 
WTCDCHLKWL SDWIQKKPDV IKCKKDRSPS 
MVSAAAFQCA KPTIDSSLKS KSLTILEDSS 
TDQSGNEANM VCSIQKPSRT SPIAFTEEND 
IQPVWQILAL YSDSPLILER SHLLSETPQL 
DLRADPSWLM QDQISLQLNR TATTFSTLQI 
KWTMISRDNN TKLEHTVLVG GTVGLNCPGQ 
YVSEDGRILI DKSGKLELQM ADSFDTGVYH 
PLVEAYQENG IHHTVFIGET LDLPCHSTGI 
DKKVLNNGTL RILQVTPKDQ GYYRCVAANP 
EHDGETEGSG LDESNPIAHL KEPPGAQLRT 
YRELTLQRRG DSTHRRFREN RRHFPPSARR 
KRENTTVSPP PWTQLPNIP GEEDDSSGML 
VTADSRTISD SPMTNINYGT EFSPWNSQI 
SKNINPTMSS QIQGTTNQHS STVFPLLLGA 
PITVRTMIKD VNVKMLSSTT NKLLLESVNT 
SHTTQILSTS. TFPSDPHTAA HSQFPIPRNS 
RIISPYRTPV LRRHRYSIFR STTRGSSEKS 
RLTTATAALS FPSAAPITFP KADIARVPSE 
KTTPTIKYFR TEISQVTPTG AVMTYAPTS I 
AKRDSVITSS LSGAITKPPM TIIAITRFSR 
NQHKVSLQKS TAVMLPKTSP ALPQRQSSPF 
HHTTTKTHNP Gslj p TKKELj p FPPLNPMLPS 

TTPTFPASVI TYETQTERSR AQTIQREQEP 
PTAMTPPALA FTHSPPENTT GISSTISFHS 
KSTIASETTL SSKSHQSTTT RKASLDTPIP 
RAVTDTRGDS HFRLMTNTW KLHESSRHNL 
TPMPALTTVK SQNSKLTPSP WAEYQFWHKP 
LSEATTLVSD WDGQKNTKKS DFDKKPVQEA 
IVGGKAASFT IPANSDAFLP CEAVGNPLPT 
QVLPNGTLSI QRVEIQDRGQ YLCSASNLFG 
RTKEITVHSG STVELKCRAE GRPSPTVTWI 
VDGTLVLHNL SIYDRGFYKC VASNPGGQDS 
VIVGTWGESL KLPCTAKGTP QPSVYWVLSD 
TLYIRNLASS DRGTYECIAT SSTGSERRW 
TEVNFGDKLL LNCSATGEPK PQIMWRLPSK 
SVTEKDSGVY LCVARNKMGD DLILMHVSLR 
DFQVDCKASG SPVPEISWSL PDGTMINNAM 
YFNKVGVAEE GDYTCYAQNT LGKDEMKVHL 
AGDTAVLDCE VTGDPKPKIF WLLPSNDiMIS 
LLDSGEYVCV ARNPSGDDTK MYKLDWSKP 
SKKHFDCRAE GTPSPEVMWI MPDNIFLTAP 
LSDSADFICV ARNEGGESVL WQLEVLEML 
TALNCSVDGN PPPEIIWILP NGTRFSNGPQ 
DAGKYRCAAR NKVGYIEKLV ILEIGQKPVI 
VSDGIPKPNI KWTMPSGYW DRPQINGKYI 
ICKAQNSVGH TLITVPVMIV AYPPRITNRP 
VPKPEITWEM PDHSLLSTAS KERTHGSEQL 
CTAKNPLGSD YAATYIQVI* 
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OCP rat amino acid sequence 

>608-663Nterm Rat Protein (663 aa) 

MQVRGREVSGLIj I SLTAVCLWTPGS RACPRRCAC YVPTEVHCT FR YLTS I PDGI PANVE 
RINLGYNSLTRLTENDFDGLSKLELLMLHSNGIHRVSDKTFSGLQSLQVLKMSYNKVQII 
RKDTFYGLGSLVRLHLDHNNIEFINPEAFYGLTSLRLVHLEGNRLTKLHPDTFVSLSYLQ 
IFKTSFIKYLFLSDNFLTSLPKEMVSYMPNLESLYLHGNPWTCDCHLKWLSEWMQGNPDI 
IKCKKDRSSSSPQQCPLCMNPRISKGRPFAMVPSGAFLCTKPTIDPSLKSKSLVTQEDNG 
SASTSPQDFIEPFGSLSLNMTXXSGNKADMVCSIQKPSRTSPTAFTEENDYIMLNASFST 
NLVCSVDYNHIQPVWQLLALYSDSPLILERKPQLTETPSLSSRYKQVALRPEDIFTSIEA 
DVRADPFWFQQEKIVLQLNRTATTLSTLQIQFSTDAQIALPRAEMRAERLKWTMILMMNK 
PKLERTVLVGGTIALSCPGKGDPSPHLEWLLADGSKVRAPYVSEDGRILIDKNGKLELQM 
ADS FDAGL YH CIS TNDADAD VLT Y R I T WE P YGE S THD S G VQHT WTGETLDLP C L S TG V 
PDASISWILPGNTVFSQPSRDRQILNNGTLRILQVTPKDQGHYQCVAANPSGADFSSFKV 
SVQ 
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